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ABSTRACT 


-  _ A  sequence  of  11,000  waves  that  passed  an  ODGP  platform  during  Hurricane 
Camille  is  studied  as  55  successive  samples  of  200  waves  each  to  determine 
whether  or  not  the  Rayleigh  probability  density  function  fits  the  crest  to 
trough  heights  defined  in  the  conventional  manner  and  whether  or  not  the 
highest  wave  in  the  sample,  which  was  72.2  feet  from  crest  to  trough,  was 
highly  improbable  in  terms  of  extreme  value  theory. 

Five  reasons  why  a  sample  of  wave  heights  can  depart  from  the  theoretical 
Rayleigh  distribution  are  analysed.  Two  are  theoretical;  two  have  to  do  with 
sampling  problems;  and  the  last  is  concerned  with  spectral  estimation 
procedures.  It  was  not  possible  to  isolate  each  of  the  five  reasons.  The 
results  do  however  show  that  if  the  last  three  are  taken  into  account,  the 
Rayleigh  distribution  comes  close  to  describing  the  samples  adequately. 


The  highest  waves  that  occurred  were  not  unusual  when  tested  against 


extreme  value  theory. 
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There  are  some  indications  that  the  significant  wave  height  in  a 

u 

random  sea  is  not  correctly  predicted  by  4(mo)^  where  mQ  (or  Eq,  or  P0 
depending  on  the  author)  is  the  variance  of  the  random  waves  and  the  waves 
are  thought  of  as  passing  a  fixed  point  as  a  function  of  time.  Even  more, 
this  implies  that  the  crest  to  trough  wave  heights  do  not  have  a  Rayleigh 
probability  density  function.  In  particular,  if  Eq  is  the  area  under  the 
variance  spectrum,  the  pdf 

f(h)  =  y-jr-  exp(-h2/8EQ) 
o 

and  the  cumulative  density  function  (cdf) 

F (h)  =1  -  exp  (-h2/8EQ) 

do  not  appear  to  describe  the  waves  very  well. 

The  short  crested  Gaussian  (and  hence  linear)  seaway  has  served  its 
purpose  well  as  a  first  step  in  departing  from  the  oversimplified  models 
of  three  of  four  decades  ago, and  many  of  the  concepts  can  be  and  are  still 
being  used  for  practical  applications.  Waves,  nevertheless,  are  nonlinear, 
and  over  the  past  decade,  or  so,  there  have  been  many  attempts  to  treat  non¬ 
linear  aspects  of  waves  combined  with  other  attempts  to  treat 
linear  models  without  the  restrictive  assumptions  implied  by  the  Rayleigh 
pdf.  A  review  of  much  that  is  known  about  the  probabilistic  and  statistical 
properties  of  waves  has  been  given  by  Ochi  (  1982)  .  Substantial  advances 
have  been  made  since  the  work  of  Rice  (  1944),  and  much  of  the  linear  theory 
still  has  useful  applications. 

If  equations  (1)  and  (2)  are  not  good  enough  to  describe  ocean  wave 
heights  with  sufficient  accuracy,  and  if  in  particular 

hi/3  -  "'V15 

is  an  incorrect  equation  and  the  average  of  the  heights  of  the  one  third 
highest  waves  in  an  ocean  wave  time  history  is  not  related  to  the  wave 
spectrum  in  this  simple  way,  then  better  models  must  be  found. 

The  questions  that  naturally  arise  if  equations  (1)  to  (3)  are  inadequate 

are: 


Cl) 


(2) 
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1)  What  probability  density  functions  can  be  used  in  place  of 
equation  (1)  and  under  what  circumstances  are  they  applicable'.’ 

2)  How  can  it  be  proved  by  means  of  a  sample  of  crest  to  trough 
wave  heights  from  a  wave  record  that  a  replacement  pdf  is  superior  to 
equation  (1)  and  that  it  gives  better  predictions  for  the  wave  heights; 

3)  An  extreme  value  theory  based  on  equation  (1)  has  been  developed. 
If  equation  (1)  does  not  apply,  what  are  the  implications  with  reference 
to  the  prediction  of  extreme  waves? 

Historical  records  are  replete  with  reports  ,  photographs  and  recordings 
of  extreme  waves.  Coles  (1966)  describes  them  as  "freak"  waves  and  refer: 
to  them  19  times  in  a  304  page  book  with  end  plates  that  look  remarkably  like 
some  figures  from  Longuet-Higgins  and  Cokelet  (1976) .  "Freak"  troughs  are 
described  by  Lawrence  Draper  in  an  appendix  to  Cokelet' s  book.  A  partial 
quote  from  this  appendix  follows  where  dashes  show  parts  of  the  full  text 
have  been  omitted. 

"Reports  of  freak  waves  usually  concern  waves  with  unexpectedly  high  crests, 
as  in  most  of  the  instances  described  in  previous  chapters,  but  there  is  just 
as  much  chance  of  an  unusually  low  trough  occurring.  The  reason  why  they  are 
not  often  reported  must  be  that  a  high  crest  can  be  seen  from  a  large  distance, 
but  a  vessel  would  have  to  be  on  the  very  edge  of  a  deep  trough  to  notice  it. 

Two  reports  of  deep  troughs  (at  least,  that  is  what  we  believe  them  to  have 
been)  were  described  in  the  Marine  Observer  under  the  heading  "The  one  from 
nowhere".  The  following  is  an  extract  from  the  report  of  Commodore  V.  S. 

Byles,  R.  D.,  master  of  the  Edinburgh  CuStle,  and  a  similar  report  of  a 
wartime  experience  by  Commander  I.  R.  Johnston,  R.  N.  (Retired),  appearing 
in  that  article:"  - 

"To  further  ensure  that  no  untoward  incident  should  occur,  1  took 
a  knot  off  her  speed  and,  to  close  the  coast  I  had,  of  course,  put  the 
swell  cosily  on  the  bow  instead  of  driving  into  it  head-on.  Under  these 
conditions  she  was  very  confortable  for  three-quarters  of  an  hour  or  ^o. 

The  distance  from  one  wave  top  to  the  next  was  about  150  ft.  and  the 
ship  was  pitching  and  scending  about  10-15  degrees  to  the  horizontal. 

And  then  it  happened.  Suddenly,  having  scended  normally,  the  wave 
length  appeared  to  be  double  the  normal,  about  300  ft.,  so  that  when 
she  pitched  she  charged,  as  it  were,  into  a  hole  in  the  ocean  at  an 


angle  of  30  degrees  or  more,  shovelling  the  next  wave  on  board  to  a 
height  of  IS  or  20  ft.  before  she  could  recover,  as  she  was  "out  of 
step"." 

"Commodore  Byles's  article  was  reported  in  the  national  Press  and  brought 
the  following  comments  from  Commander  I.  R.  Johnston,  R.N.  (Retired):"  - 

"We  were  about  100  miles  south-south-west  of  Durban  on  our  way  to 
Cape  Town,  steaming  fast  but  quite  comfortably  into  a  moderate  sea  and 
swell  when  suddenly  we  hit  the  "hole"  and  went  down  like  a  plummet  into 
the  next  sea  which  came  green  over  A  and  B  turrets  and  broke  over  our 
open  bridge.  I  was  knocked  violently  off  my  feet,  only  to  recover  and 
find  myself  wading  around  in  2  ft.  of  water  at  a  height  of  60  ft.  above 
normal  sea-level." 

Buckley  (1983)  refers  to  extreme  waves  as  "episodic"  waves  and  has 
collected  a  large  number  of  reports  on  the  effects  of  these  waves  on  ships 
structures.  He  also  reproduces  the  time  history  records  of  five  of  the 
extreme  waves  recorded  during  Hurricane  Camille. 

Questions  that  arise  in  connection  with  extreme  (and  rare)  high  waves 

are: 


1)  How  do  they  form? 

2)  Are  they  a  natural,  and  explainable,  consequence  of  the 
randomness  of  the  waves? 

3)  How  must  mathematical -probabilistic  models  of  a  seaway  be 
corrected  to  account  for  them? 

The  problems  of  a  replacement  (or  replacements)  for  the  Rayleigh  pdf 
and  of  the  prediction  and  description  of  extreme  waves  raise  more  questions 
than  can  be  answered  completely.  However,  both  presently  available 
experimental  data  and  aspects  of  newly  developed  theories  (which  can  be 
shown  to  lead  to  some  strange  contradictions)  point  the  way  to  some  partial 
answers  and  some  tentative  conclusions. 
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DATA  FROM  HURRICANE  CAMILLE 


Ocean  wave  time  histories  obtained  by  the  ODGP  program  during  Hurricane 
Camille  are  studied  in  terms  of  the  series  of  crest  to  trough  heights  so  as 
to  determine  if  they  do  or  do  not  form  an  independent  random  sample  and  how 
well  the  Rayleigh  probability  density  function  fits  the  sample  wave  heigV. 
distribution  and  how  well  extreme  value  theory  predicts  the  highest  waves  in 
samples  200  consecutive  waves. 

Hurricane  Camille  was  one  of  the  most  thoroughly  studied  hurricanes 
in  history,  and  the  data  on  waves  from  it,  as  gathered  from  the  Ocean  Data 
Gathering  Program  (ODGP)  of  a  consortium  of  oil  companies  has  had  extensive 
study.  Examples  of  research  and  papers  on  the  waves  in  this  hurricane  are 
given  by  Ward  (1974),  Ward,  et  al .  (1978),  Cardone,  Pierson  and  Ward  (1976), 

Spring  (1978)  and  Buckley  (1983).  The  study  of  the  probability  density 

function  of  wave  heights  by  Spring  (1978)  showed  that  even  the  additional 
freedom  to  choose  a  value  other  than  Eq  could  not  provide  a  better  fit  and 
that  minor  empirical  variants  of  the  Rayleigh  pdf  did  not  do  much  better. 
This  present  study  is  a  continuation  of  research  completed  in  March  1982 
for  DTNSRDC  (Pierson  and  Salfi  (1982)  (unpublished)).  Pertinent  results 
from  that  study  are  included.  Many  of  our  results  are  superceded  by  this 
present  study.  This  present  study  parallels  much  of  the  difficult  and  time 
consuming  work  done  by  Spring  in  the  development  of  computer  programs  to 
describe  the  heights  and  periods  of  the  waves.  Some  of  the  spectral  wave 
properties  at  ODGP  Station  1,  as  a  function  of  time  as  Camille  approached 

that  offshore  oil  rig,  were  also  studied  by  Spring.  We  provide  a  brief 

review  of  some  of  the  results  given  by  Spring,  before  proceeding  with  our 
own  particular  analysis.  It  was  possible  to  obtain  data  that  were  in  a 
form  very  similar  to  that  of  Spring,  but  the  data  were  interpreted  in  a 
somewhat  different  way  in  terms  of  the  theoretical  derivations  as  given  in 
the  following  sections  of  this  paper. 

One  of  the  first  steps  in  the  analysis  of  Spring,  was  to  break  the 
continuous  22.5  hour  time  history  of  the  rise  and  fall  of  the  sea  surface 
at  Station  1,  into  samples  of  approximately  7.5  minutes  in  duration. 

Table  1  of  this  study  is  a  copy  of  Table  4  from  the  work  of  Spring,  in 
which  the  7.5  minute  segments  were  treated  as  a  total  of  180  records 
throughout  the  duration  of  the  recorded  data.  Only  the  first  five  columns 
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of  the  table  produced  by  Spring,  are  reproduced  because  the  remaining  data 
in  his  analysis  were  specific  to  his  particular  method  of  study.  Table  1 
(of  this  study)  gives  the  record  number,  the  significant  wave  height,  computed 
as  the  average  of  the  one-third  highest  crest  to  trough  wave  heights  in  the 
time  history,  the  spectral  width  parameter,  to  be  defined  later,  the  number 
of  waves,  and  the  significant  wave  height  computed  from  4.005  times  the 
square  root  of  the  area  under  the  variance  spectrum.  (Forrestall  (1978)). 

As  can  be  seen  from  inspection  of  this  table,  the  significant  wave 
height  computed  from  the  data  was  generally  less  than  the  significant 
wave  height  computed  from  the  area  under  the  spectrum.  The  number  of  waves 
decreases  from  96  waves  in  Record  Number  1  to  39  waves  in  Record  Number  180, 
at  the  peak  of  the  storm  measured,  or  counted,  during  the  7.5  minute  sub¬ 
sample.  The  spectral  width  parameter  is  highly  erratic,  from  one  record  to 
the  next . 

A  sample  of  7.5  minutes  duration  can  yield  highly  variable  estimates 
of  spectra,  and  although  the  number  of  waves  during  the  early  part  of  the 
total  record  is  fairly  high,  there  do  not  appear  to  be  enough  waves  in  the 
7.5  minute  samples  toward  the  end  of  the  series  under  study  to  yield  an 
adequate  sample. 

Table  2  of  this  paper  reproduces  Table  5  from  Spring  in  much  the  same 
way  as  the  proceeding  Table  1  reproduced  Table  4.  Here  the  records  are 
fifteen  minutes  long,  and  there  are  half  the  number  of  records  compared  to 
the  preceeding  data.  Again  the  significant  height  computed  from  this  new 
larger  sample,  the  spectral  width  parameter  and  the  significant  height 
computed  from  the  area  under  the  spectrum  are  given.  It  is  noted  that 
Records  1  and  2  of  Table  1  contain  192  waves,  whereas  Record  1  of  Table  2 
contains  195  waves,  or  three  more  waves  than  Records  1  and  2  of  Table  1. 

These  descrepancies  have  to  do  with  the  assumptions  made  at  the  ends  of 
each  sample  record  when  it  was  divided  into  15  minute  versus  7.5  minute 
samples.  For  the  shorter  samples  of  the  waves  the  ends  of  the  small 
samples  would  be  omitted  if  they  did  not  go  through  a  complete  cycle. 

Again  it  is  noted  that  the  significant  height  computed  from  the  area  under 
the  spectrum  tends  to  be  larger  than  the  significant  height  computed  from 
the  actual  data. 

Finally,  Table  3,  which  is  Table  6  from  Spring  shows  the  results  of 
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samples  of  30  minute  duration.  Record  1  for  example,  contains  393  waves,  as 
opposed  to  387  waves  for  combined  Records  1  and  2  of  Table  2.  The  same  thing 
is  happening  as  the  data  are  pooled  in  even  larger  samples.  Again  the 
significant  wave  height  computed  from  actual  wave  heights  tends  to  be  smaller 
than  the  height  computed  from  the  area  under  the  spectrum. 

Figures  1  and  2  are  graphs  of  the  significant  wave  heights  as  a  function 
of  time  as  plotted  from  Tables  1.  2  and  3.  Figure  1  shows  the  significant 
wave  height  computed  from  the  7.5  minute  samples.  Unless  indicated  by  x’  , 
the  upper  value  as  a  dot  is  the  significant  wave  height  computed  from  the 
area  under  the  spectrum,  and  the  lower  value,  as  shown  by  a  dot,  is  the 
significant  wave  height  computed  from  the  7.5  minute  sample.  There  are  four 
records  for  which  the  reverse  situation  occurred,  and  the  significant  wave 
height  from  the  spectrum  was  lower  than  the  value  from  "bump  counts".  These 
are  show  by  x's.  This  graph  does  not  start  at  the  smallest  values  of  the 
significant  wave  height.  It  starts  at  a  somewhat  later  time  than  the  tabulated 
data  in  order  to  fit  it  conveniently  to  the  graph  paper  that  was  used.  The 
start  is  Record  Number  31. 

Figure  1  also  shows  some  of  the  effects  of  sampling  variability  from  one 
7.5  minute  sample  to  another;  the  fluctuations  between  Record  110  and  130, 
Record  121  in  particular,  and  the  sharp  jump  from  Record  135  to  136,  are 
indicative  of  an  effect  of  small  sample  size.  As  the  waves  get  higher,  and 
the  number  of  waves  in  the  7.5  minute  samples  decrease,  the  irregularities 
become  very  pronounced.  For  example.  Record  180  indicates  a  significant  wave 
height  of  50.5  feet,  for  one  method  of  analysis  and  52.5  feet  for  the  other 
method  of  analysis.  The  one  7.5  minutes  earlier  indicates  significant  wave 
heights  of  33.5  and  35.5  for  the  same  conditions  for  the  same  record  length. 

The  change  is  rather  dramatic. 

The  data  for  the  15  minute  samples  and  the  30  minute  samples  are 
combined  in  Figure  2.  The  fifteen  minute  samples  are  shown  below  and  the 
thirty  minute  samples  are  shown  above,  with  a  displaced  coordinate  system. 

Tn  both  cases  the  significant  wave  height  approaches  a  value  slightly  over 
forty  feet. 

Figure  3  is  a  graph  of  the  spectral  width  parameter  for  successive  15 
minute  samples.  It  varies  very  erratically.  A  five  point  running  mean  of 
of  the  spectral  width  parameter  is  shown  with  the  scale  on  the  right. 
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FIGURE  1  Significant  Wave  Height  Versus  Time  for  7.5  Minute  Samples. 
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FIGURE  3  Spectral  Width  Parameter  as  a  Function  of  Time  for  15  Minute  Samples. 
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TABLE  1A  Seven  and  One  Half  Minute  Samples  from 
Hurricane  Camille;  H_  Significant  Height  from  Data; 


SWP,  Spectral  Width  Parameter: 
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TABLE,  IB  Continuation  of  Table  LA,  Records  36  to  70 
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36 

7.  18 

.483398 

59 

8.72 

37 

6.78 

.425596 

72 

8.08 

38 

6.77 

.305556 

75 

7.67 

39 

7.29 

.550815 

63 

8.  ! 5 

40 

6.57 

.4  14307 

75 

7.55 

41 

7.22 

.389648 

75 

7.99 

42 

6.70 

.402  168 

75 

7.54 

43 

7.99 

.565270 

60 

9.23 

44 

8.  10 

.483398 

69 

8.94 

45 

7. 13 

.472465 

69 

7.95 

46 

7.86 

.449038 

72 

8.63 

47 

9.23 

.463359 

63 

10.34 

48 

7.74 

.248889 

78 

8.67 

49 

7.32 

.426079 

75 

9.0! 

50 

9.39 

.566330 

54 

10.23 

51 

8.15 

.531073 

63 

9.22 

52 

9.26 

.437500 

63 

9.96 

53 

8.89 

.570749 

57 

9.65 

54 

9.50 

.473977 

66 

10.76 

55 

9.15 

.531073 

63 

9.75 

56 

10.30 

.4  1  1033 

66 

1  I  .67 

57 

9.31 

.564408 

66 

10.03 

58 

9.71 

.370987 

59 

10.64 

59 

11.19 

.523379 

57 

I  1  .74 

60 

9.6! 

.349635 

75 

10.54 

61 

10.31 

.332653 

66 

I  1  .30 

62 

10.26 

.987474 

63 

1  1  .09 

63 

9.30 

.495353 

65 

10.28 

64 

9. 16 

.54!  103 

63 

10.81 

65 

10. 17 

.423364 

63 

11.17 

66 

8.62 

.504269 

69 

10.  16 

67 

9.09 

.363400 

75 

10.71 

68 

9.42 

.376731 

75 

10.70 

69 

10.06 

.365201 

69 

l!.3i 

70 

9.93 

.924495 

66 

10.9! 

11 


TABLE  1C  Continuation  of  Table  1A,  Records  71  to  105 


REC 

H 

S 

SW3 

WAVES 

71 

10.44 

.545512 

60 

1 1 .08 

72 

10.62 

.425069 

69 

11.60 

73 

9.80 

.424495 

66 

1 1 .25 

74 

1  1 .34 

.367689 

66 

12.35 

75 

1  1  .23 

.437500 

63 

12.47 

76 

10.0! 

.425596 

72 

11.02 

77 

10.98 

.41  1033 

66 

12.26 

78 

1  !  .46 

.450657 

63 

12.45 

79 

1  I  .49 

.423169 

60 

12.48 

30 

10.91 

.501730 

60 

11.85 

31 

10.90 

.409726 

63 

12.1  1 

82 

9.74 

.397093 

66 

1  1 .47 

83 

(0.89 

.274343 

69 

12.40 

■34 

!  i  .74 

.5  10000 

63 

12.99 

85 

1  1.95 

.524376 

60 

12.94 

86 

12.05 

.570749 

57 

13.03 

37 

10.92 

.464604 

60 

12.07 

83 

12.23 

.555556 

50 

13.53 

39 

1  1 .84 

.463359 

63 

12.89 

90 

1  1 .66 

.437474 

63 

13.50 

91 

12.33 

.376731 

60 

13.20 

92 

12.81 

.423169 

60 

14.05 

93 

12.0! 

.450557 

63 

13.38 

94 

1  1  .63 

.535124 

60 

12.46 

95 

10.48 

.274348 

69 

1  1 .99 

96 

12.53 

.450557 

63 

13.95 

97 

13.06 

.53954  I 

57 

14.94 

93 

1  1.50 

.437500 

63 

12.37 

99 

1  1 .99 

.539541 

57 

13.58 

100 

12.93 

.526379 

57 

14.90 

101 

13.91 

.464404 

60 

14,41 

102 

12.90 

.379844 

63 

14.04 

103 

14.34 

,364044 

63 

14.45 

104 

14.80 

.495152 

54 

15.76 

105 

14.05 

.463359 

63 

14,59 

TABLE  10,  Continuation  of  Table  1A,  Records  106  to  140. 


REC 

H 

RH 

_ S_ 

S  WP 

t WAVES 

S 

106 

14.56 

.475624 

63 

16.26 

107 

13.47 

.50480! 

57 

15.70 

108 

14.  16 

.437500 

60 

15.81 

109 

14. 12 

.463359 

60 

15.32 

1  10 

15.79 

.373264 

57 

17.43 

1  1  1 

14.20 

.492344 

57 

16.83 

1  ,2 

13.70 

.390317 

57 

15.31 

1  13 

16.26 

.465976 

57 

17.54 

1  14 

15.07 

.504801 

57 

15.56 

1  15 

14.47 

.422400 

57 

16.60 

1  16 

17.79 

.5!  1916 

5! 

18.68 

1  17 

15.53 

.525018 

5 ! 

16.87 

1  18 

15.56 

.5081809 

54 

17.43 

I  19 

17.91 

.51  1916 

51 

19.  14 

120 

15.68 

.421543 

54 

17.14 

121 

21.10 

.419501 

48 

21.89 

122 

17.24 

.453686 

51 

18.90 

123 

18.  13 

.359377 

54 

19.43 

124 

19.00 

.525018 

51 

20.84 

125 

16.81 

.297362 

57 

13.57 

126 

19.63 

.549688 

51 

20.75 

127 

20.  15 

.525018 

51 

21  .53 

128 

22.90 

.555556 

48 

24.81 

129 

20.42 

.437500 

51 

23.54 

130 

21.56 

.537600 

51 

22.56 

131 

22.45 

.453656 

51 

24.25 

132 

20.44 

.4536S5 

4  1 

22.77 

133 

21.81 

.542948 

48 

22.69 

134 

22.77 

.529796 

48 

25.00 

135 

20.91 

.574569 

45 

23.45 

136 

27.17 

.562067 

45 

27.92 

137 

26.00 

.520710 

45 

27.44 

138 

29.43 

.542948 

48 

25.32 

139 

28.24 

.520710 

45 

30.54 

!  40 

24.01 

.555556 

43 

26.91 
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TABLE  IE,  Continuation  of  Table  1A,  Records  141  to  175 


v 


» 


REC 

Hs 

SWP 

"WAVES 

RHs 

141 

28.99 

.525934 

42 

30.57 

142 

29.4  1 

.419501 

48 

32.  1  1 

143 

25.91 

.574669 

45 

28.84 

144 

29.94 

.618512 

42 

32.  19 

145 

39.07 

.493249 

42 

39.81 

146 

26.93 

.562067 

45 

23.33 

147 

24.98 

.473205 

45 

28.3 

148 

29.  1  1 

.535124 

45 

32.05 

149 

33.06 

.455791 

45 

35.77 

150 

25.47 

.469184 

5  1 

28.73 

151 

34. 12 

.547860 

39 

34.84 

152 

29.94 

.338121 

48 

32.01 

!  53 

23.32 

.5207  10 

45 

30.72 

154 

33.85 

.22  i 4  53 

45 

36.  16 

155 

32.26 

.541  103 

42 

33.  18 

1 56 

33.60 

.591239 

39 

34.93 

157 

40.21 

.664820 

33 

4  0.53 

158 

39.  15 

.618512 

42 

33.84 

159 

29.25 

.607039 

42 

31.32 

160 

31.26 

.555556 

42 

35.31 

161 

35.05 

.436745 

48 

37.32 

162 

39. 18 

.562067 

45 

34.90 

163 

29. 13 

.582435 

42 

32.06 

164 

32.99 

.505515 

45 

36.27 

165 

31.21 

.535124 

45 

33.03 

166 

35.8  ! 

.350812 

48 

33.33 

167 

34.05 

.505615 

45 

35.95 

168 

36.05 

.437500 

45 

37.68 

169 

37.57 

.473205 

45 

39.57 

170 

39.42 

.541 103 

42 

39.55 

17! 

32.35 

.525934 

42 

34.94 

172 

45.  10 

.478395 

39 

44.61 

173 

33.69 

.398038 

45 

40.  1  1 

174 

38.37 

.437500 

45 

40.01 

175 

43.07 

.555556 

42 

44.20 

r? 


J 


'■ * 

rr 

y  . 


« 
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TABLE  IF,  Continuation  of  Table  1A,  Records  176  to  180 


REC 

Hs 

S  VP 

,7  WAVES 

RHs 

176 

47.38 

.437500 

45 

47.94 

177 

4  1.37 

.505615 

45 

43.63 

178 

42.43 

.473205 

45 

42.99 

179 

33.23 

.384379 

5! 

35.4  1 

180 

50.24 

.478395 

39 

52.29 

TABLE  2A,  Fifteen  Minut-  Samples  from  Hurricane 
Camille;  see  Table  LA,  Records  1  to  35. 


REC 

Hs 

Si  ' 

jPV.'AVES 

RH 

S 

1 

3.26 

.255521 

195 

3.69 

2 

3.  16 

.245224 

192 

3.66 

3 

3.  18 

.340775 

177 

3.65 

4 

3.30 

.227236 

189 

3.80 

5 

3.74 

.322250 

177 

4.24 

6 

3.88 

.295324 

183 

4.34 

7 

3.88 

.32851  1 

177 

4.48 

8 

3.73 

.302933 

177 

4.56 

9 

4.35 

.366052 

168 

4.35 

10 

4.46 

.339343 

165 

5.03 

I  1 

4.52 

.358446 

165 

5.  17 

12 

4.78 

.339343 

165 

5.39 

13 

5.94 

.465691 

144 

6.77 

14 

5.88 

.4  19877 

147 

6.85 

15 

5.82 

.496809 

144 

6.66 

15 

6.31 

.454332 

147 

7.  16 

17 

6.60 

.4014  13 

147 

7.25 

18 

7.13 

.483738 

133 

7.99 

19 

6.77 

.342586 

150 

7.97 

20 

6.98 

.466268 

14  1 

7.85 

21 

6.92 

,395957 

150 

7.77 

22 

8.  15 

.507017 

132 

9. 09 

23 

7.44 

.460693 

14  1 

8.30 

24 

8.53 

.353181 

14  1 

9.63 

25 

3.67 

.473977 

132 

9.67 

26 

8.72 

.46383! 

120 

9.60 

27 

9.  1  1 

.504510 

126 

10.22 

28 

9,70 

.450059 

132 

1C. 75 

29 

9.56 

.464354 

135 

10.44 

30 

10,41 

.437500 

132 

11.16 

31 

10.29 

.437500 

129 

11.19 

32 

9.21 

.496358 

132 

10.55 

33 

9.47 

.443698 

135 

10.67 

34 

9.40 

.376731 

150 

10.70 

35 

10.02 

.358201 

133 

1  1  .  1  I 

16 


Continuation  of  Table  2A,  Records  36  to  70 


TABLE  2B , 


REC 

H 

RH 

S 

SV.P 

#V.AVES 

S 

36 

10.48 

.462222 

132 

1  1 .34 

37 

10.53 

.397093 

132 

11.81 

38 

10.79 

.437500 

135 

1  1  .77 

39 

1  1.22 

.437500 

129 

12.35 

40 

1  1  .29 

.444298 

123 

12.  17 

41 

10.44 

.382653 

132 

1  1.79 

42 

1  !  .42 

.411033 

132 

12.70 

43 

1  1  .93 

.529796 

120 

12.99 

44 

11.65 

.518351 

120 

12.82 

45 

1  1.78 

.475524 

126 

13.20 

46 

12.53 

.378328 

123 

13.63 

47 

1  1.74 

.469545 

126 

12.93 

43 

1  1 .58 

.338620 

135 

13.00 

49 

12.60 

.470292 

123 

13.71 

50 

12.57 

.509161 

1  17 

14.26 

51 

13.41 

.423525 

123 

14.79 

52 

14.80 

.408234 

120 

15.61 

53 

14.27 

.443934 

129 

15.45 

54 

14.07 

.444466 

120 

15.75 

55 

14.31 

.423169 

120 

16.41 

56 

14.37 

.334852 

120 

16.09 

57 

15.64 

.492344 

1  14 

17 .06 

53 

16.49 

.437500 

1  1  1 

17.67 

59 

15.57 

.516459 

105 

17.  15 

50 

16.77 

.445232 

108 

18.  17 

61 

19.09 

.  .402393 

102 

20.45 

52 

13.62 

.421543 

108 

20.  15 

63 

18.38 

.405316 

1  1  1 

19.69 

64 

21.58 

.450205 

99 

23.27 

65 

20.90 

.460354 

105 

23.06 

66 

21.37 

.421078 

105 

23.52 

67 

22.  17 

.5 13936 

99 

23.87 

68 

24.51 

.539137 

93 

25.79 

69 

25.  10 

.501730 

96 

26.65 

70 

26.59 

.477809 

99 

28.94 

TABLE  2C,  Continuation  of  Table  2A  Records  71  to  90 


REC 

Hs 

SVP 

AV.AVES 

RH 

S 

71 

29.57 

.437500 

93 

31.35 

72 

27.96 

.568437 

90 

30.57 

73 

33.07 

.497799 

90 

34.55 

74 

26.91 

.505615 

90 

30.24 

75 

29.67 

.462968 

96 

32.45 

76 

31.89 

.447074 

87 

33.4  5 

77 

31.11 

.398038 

90 

33.55 

78 

32.65 

.566330 

31 

34.07 

79 

40.62 

.610624 

78 

39.07 

SO 

30.86 

.552130 

87 

33.34 

81 

34.41 

.494321 

96 

36.  13 

82 

30.99 

.513250 

90 

34.23 

83 

33.52 

.463761 

93 

35.78 

84 

35.29 

.437500 

93 

36.03 

85 

38.26 

.473205 

90 

39.56 

86 

33.70 

.427675 

87 

40.07 

87 

38.87 

.4 18271 

90 

40.06 

33 

44.94 

.464604 

90 

45.  1  1 

09 

41,56 

.455215 

93 

43.32 

90 

42.68 

.437500 

90 

44.66 

vv 
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TABLE  3A, 

Thirty  Minute 

Samples  from  Hurricane 

Camille, 

See 

Table  1A.  Records  1 

to  35. 

H 

RH 

REC 

S 

SWP 

WAVES 

S 

1 

3.  19 

.230454 

393 

3.69 

2 

3.24 

.273755 

359 

3.73 

3 

3.79 

.285526 

355 

4.30 

4 

3.33 

.319087 

354 

4.52 

5 

4.38 

.356139 

333 

4.94 

6 

4.58 

.349000 

330 

5.28 

7 

5.91 

.446099 

291 

6.8  1 

b 

6.02 

.465137 

294 

6.91 

9 

6.85 

.425720 

291 

7.63 

10 

6.88 

.393250 

294 

7.  19 

1  1 

7.53 

.440449 

285 

8.45 

12 

8.04 

.400217 

285 

8.99 

13 

8.69 

.462222 

264 

9.63 

14 

9.42 

.480634 

253 

10.99 

15 

10 .03 

.452805 

270 

10.80 

!6 

9.82 

.459222 

264 

10.88 

17 

9.43 

.384769 

291 

10.69 

13 

10.25 

.428005 

270 

1  1  .22 

19 

10.66 

.404893 

270 

1  1  .79 

20 

1  1  .25 

.430824 

255 

12.26 

2! 

1  1  .06 

.397093 

264 

12.25 

22 

1  1 .86 

.515201 

243 

12.90 

23 

12.32 

.4 16860 

252 

13.42 

24 

1  1.65 

,393531 

254 

12.97 

25 

12.57 

.475951 

243 

13.99 

26 

14.1! 

.401555 

246 

15.2  1 

27 

14.27 

.4442.6 

249 

15.60 

28 

14.63 

.404473 

240 

16.25 

29 

16.07 

.437500 

231 

17.37 

30 

16.22 

.485665 

213 

17.67 

31 

18.83 

.395692 

213 

20.30 

32 

20.01 

.464235 

213 

21.55 

33 

21.14 

.44  1495 

210 

23.29 

34 

23.34 

.5254  19 

192 

24.35 

35 

25.96 

.493513 

195 

27.32 

•  vv 


TABLE  3B,  Continuation  of  Table  3A.  Records  36  to  45. 


PEC 

Hs 

SV.P 

"V.'AVES 

RH 

s 

36 

29.08 

.508389 

183 

30.96 

37 

30.  1  1 

.484983 

183 

32.47 

38 

31.08 

.442009 

186 

32.95 

39 

31.55 

.474375 

174 

33.3  ! 

40 

35.53 

.584579 

165 

35.75 

4  1 

32.75 

.487474 

189 

35.  19 

4? 

34.66 

.433009 

189 

36.31 

43 

38.4  5 

.451533 

177 

39.81 

44 

41  .95 

.442558 

180 

43. 19 

4  5 

42.29 

.451081 

183 

o 

o 

'C 

20 


A  major  change  in  the  way  that  the  data  were  processed  in  this  study 
was  to  regroup  the  successive  waves  into  non-overlapping  sets  of  200  waves 
each  no  matter  how  long  in  time  the  sample  lasted.  Reasons  for  doing  this 
will  become  evident  as  our  study  progresses. 

One  result  of  this  re-arrangement  is  shown  in  Fig.  4.  The  horizontal 
axis  shows  time  in  minutes  (and  hours)  during  the  passage  of  the  waves. 

The  left  axes  show  wave  height  in  feet  and  meters  and  apply  to  the  circled 
dots.  The  right  axis  shows  the  average  of  the  "periods"  cf  the  waves  in  the  200 
wave  sample  which  is  shown  by  a  "square"  on  the  figure.  The  circled  dots 
show  the  highest  wave  in  each  200  wave  sample.  The  various  numbers  by  each 
point  are  explained  on  the  figure. 

Similar  data  are  also  shown  in  tabular  form  as  in  Table  4.  There 
were  55  two  hundred  wave  samples  as  shown  by  the  first  column.  The  time  it 
took  the  200  waves  to  pass  in  minutes  is  shown  in  the  second  column  from 
which  the  average  period  can  be  found.  The  third  column  shows  the  height 
of  the  highest  wave  and  the  last  column  shows  the  period  of  that  highest 
wave.  The  highest  wave  in  the  entire  sequence  is  in  record  48  and  is  72.23 
feet  from  crest  to  trough.  The  overall  trend  for  the  highest  wave  is  upward, 
but  the  variability  from  one  record  to  the  next  is  considerably  larger  than 
the  variability  shown  in  Figure  3. 

These  55  samples  will  be  studied  in  two  different  ways  after  theoretical 
results  of  various  authors  are  described. 

Just  prior  to  providing  a  summary,  some  tentative  conclusions,  and  a 
discussion  of  the  implications  of  our  results  for  wave  forecasting,  wave 
modeling,  and  the  study  of  the  motions  of  ships  in  waves,  the  work  of 
Spring  (1978)  will  be  reinterpreted  in  view  of  both  the  theoretical  and 
practical  problems  that  arise  in  the  study  of  ocean  wave  time  histories. 


MM A  *  f FT) 

TMAX (SEC^ 

i 

IS. 73 

4  .  B  l 

5.1b 

? 

•S.BP 

4 . 4  A 

S.bH 

3 

lb.  1M 

4  ’.  6  1 

4,93 

n 

1  b  .  b  3 

S'.  33 

B.9S 

s 

U.«5 

S*.  b9 

4 , 7  B 

fe 

1  b  .  7p 

S'.  B4 

4,b3 

7 

>b.bS 

S',B9 

4.3P 

H 

1  7.9  1 

b.P(7 

5.73 

0 

1  7.99 

b'  44 

7.71 

1  w 

U.S7 

7  C  b  7 

S.93 

1 1 

Pm  .  1  4 

9.  4  X 

9. '19 

1  p 

1  o  .  b  3 

*Oi 

b  ,  7  7 

1  3 

P's  .  P  7 

1  m p  7 

7.1b 

1  l 

Pm.P9 

1  3.37 

b.SH 

1  s 

Pm.Ph 

P‘.  1  q 

b.bP 

1  b 

Pm. 57 

i  r.  a  i 

l  <A  .  PM 

i  r 

Pm. 9B 

1  1  '.SB 

*.37 

1  B 

1  4  7  f» 

8.19 

19 

2m. 37 

1  4,  b  S 

B  ,7b 

3? 

Px.*9 

1  3.TA 

A.  7b 

PI 

P 7  .  IS 

1  S  .  3  4 

9  ,  bp 

3? 

Pi  .Ah 

1  s.pp 

B  ,  b  1 

3  1 

Pi  .no 

i  a '.  4.  4 

B.bS 

3 '4 

P-3.U 

1b. 9b 

B,  1  o 

p5 

Pn.bi 

1  4.39 

S.BH 

Pb 

Pm. 8  7 

1  b  '.  b  S 

9.PP 

37 

P  3 . 4  4 

1  7  '.  4  3 

B.S7 

P« 

Pi  .99 

1  S  .  b  f-1 

9.9fl 

?9 

P  3 . 4  S 

i  b.ns 

1  '1 . 4  1 

XH 

PX.tP 

is‘.93 

B,9b 

Pp.^W 

1  7.97 

B.Pb 

3P 

P/t.Sb 

1  4  3 

9.S5 

11 

Px.M 

P4.11S 

9.37 

34 

Px.  1b 

1B.3S 

b.XS 

«s 

2x.?w 

Pb.  IS 

7,59 

xs 

P/I  .  Btl 

Pt>8 

1  7,ap 

3? 

P/J  ,  pi  J 

??.  1° 

B.P5 

<« 

P/I.^b 

P  S  .  b  9 

1^.61 

34 

P'1. 34 

p  i‘.  4  r 

7,b9 

ct  r» 

PS.  31 

Pb'.n? 

H.'IS 

'4  1 

Pfc.99 

Pb.  1  ^ 

B.bP 

a? 

Ps.bP 

ps'.bb 

1  P.b7 

«  1 

?b,9B 

39.v1b 

B.p« 

a  a 

?A.  3 4 

37  .HQ 

11.44 

45 

PS.  74 

X  X '.  b  P 

IP. 7,i 

'lb 

'  i  .Pi 

Xb  .  7b 

9.72 

'4  7 

.1m.  API 

77. 1« 

9.  IS 

4b 

3p,«7 

7  ?  .  ?  7 

IP. 45 

a  9 

3  3  .  P  7 

4  b  .  4  b 

9.99 

S* 

3x.!l9 

bS‘.  4  1 

1  P  .  1  4 

SI 

3s,  3 'P 

ST.  94 

1  1  .  H 

SP 

3i  .33 

4  S  .  7  7 

IP. 52 

S3 

3i  .43 

SP.bb 

1  P  .  4  1 

S« 

3X.3B 

bb’  47 

9,4b 

ss 

3*. M 

7(4  .  XS 

11.84 

TABLE  4  Durations  in  Minutes,  Maximum  Wave  Height  in  200  Wave  Sample,  and 
Period  of  the  Maximum  Wave  for  55  Samples  Hurricane  Camille. 

23 


STUDIES  OF  RANDOM  WAVES  AND  OF  RANDOM  WAVE  SPECTRA 


Introduction.  Although,  this  investigation  is  concerned  with  the 
description  of  the  crest  to  trough  wave  height  pdf,  whether  or  not  equations 
(1),  (2)  and  (3)  describe  the  wave  heights,  and  if  not  how  can  a  better  model 
be  found,  there  is  a  connection  to  the  problems  associated  with  spectral 
estimation,  because  the  alternative  models  to  be  described  involve  higher 
moments  of  the  spectra  in  addition  to  mQ  (or  Eq) .  If  the  spectra  are  not  well 
known  (and,  for  some  applications,  well  predicted),  the  use  of  more  sophisti¬ 
cated  models  may  not  be  warranted.  For  this  reason,  some  recent  results  in 
the  area  of  spectral  estimation  are  reviewed  in  this  section. 

Review  of  Concepts.  Waves  that  have  been  generated  on  the  ocean  by  the 
wind  are  random  in  both  space  and  time.  Nevertheless,  their  behavior  must 
satisfy  the  hydrodynamic  equations.  The  model  of  waves  as  a  short  crested 
stationary  (probabilistically)  Gaussian  process  made  up  of  the  superposition 
of  sinusoids  is  an  approximation  to  reality  since  waves  are  in  fact  non-stationary 
(i.e.  evolutive)  and  non-Gaussian.  If  they  were  not,  there  would  be  no  need 
to  measure  them  or  predict  them. 

The  basic  fundamental  concept,  which  Tukey  (1949)  called  "Power  Spectral 
Analysis"  of  a  sample  from  a  Gaussian  time  history,  as  an  extention  of  the 
earlier  seminal  work  of  Rice  (1944),  and  which  Blackman  and  Tukey  (1958) 
applified  upon  in  a  second  key  paper  is  that  time  (or  space)  series  of  data, 
which  are  highly  correlated  in  time  and,  or,  space  (hence  the  hydrodynamics 
for  waves),  can  be  samp  led  and  analysed  to  obtain  an  estimate  of  the  spectrum 
that  resolves  the  variance  of  the  random  process  into  frequency  and/or 
wavenumber  bands.  In  the  spectral  domain,  the  complicated  time  and  space 
interdependence  in  physical  space  (i.e.  x,  y,  z,  t)  becomes  totally  uncorrelated 
in  frequency,  or  wave  number  space,  for  spectra  and  understandable  in  terms 
of  cross  spectra  for  physical  or  hydrodynamic a 1  effects  (as  in  bow  submergence 
and  slamming) . 

This  basic  concept  had  to  be  interpreted  carefully  in  their  original 
work,  but  FFT  analysis  methods  (Cooley  and  Tukey  (1965))  simplified  the 


problem  clean  up.  The  fundamental  concept  of  spectral  estimation  as  a  quotation 
from  Blackman  and  Tukey  (1949)  is  that  'Clearly,  for  any  specific  value  of  Tfi  (on 
number  of  pieces  of  record)  we  can  increase  frequency  resolution  (or  stability) 
only  by  sacrificing  stability  (or  frequency  resolution)"  (pg.  93).  They  also 
state:  "The  only  major  difficulty  in  their  practical  application  (i.e.  spectral 
estimation)  is  the  extensiveness  of  the  data  required  in  highly  precise  estimates. 
Their  requirement  is  an  inherent  irrevocable  characteristic  of  such  random 
processes",  (pg .  2) 

Some  doubts  were  expressed  about  this  fundamental  concept  at  a  recent 
conference  (Baer  (1980)).  Moreover,  a  recent  special  issue  of  Proceedings  of 
the  IEEE,  with  an  historical  perspective  by  Robinson  (1982)  has  papers  that 
describe  all  sorts  of  ways  to  try  to  avoid  the  problem  of  sampling  variability. 

It  is,  and  always  will  be,  unavoidable! 

In  the  time  and  space  domain,  which  is  the  real  world,  as  opposed  to  the 
theoretical  world  of  frequencies  and  wave  numbers,  because  of  the  temporal  and 
spatial  dependence  of  the  data,  analysis  of  the  probability  structure  of  the 
waves  becomes  ever  so  much  more  complicated.  Ochi  (1982)  has  reviewed  the 
present  state  of  the  subject,  with  references  through  1981,  including  aspects 
of  extreme  value  theory  that  are  (1)  theoretically  based  on  the  Rayleigh  pdf, 

(2)  on  approximate  methods  and  (3)  based  on  a  sample  of  extreme  waves  as 
measured. 

Results  of  Donelan  and  Pierson.  Although  spectral  analysis  has  been  used  for 
quite  a  few  years,  there  are  ways  to  misinterpret,  and  to  make  incorrect 
inferences  from,  spectral  estimates.  Donelan  and  Pierson  (1983)  made  an 
experimental  check  of  the  sampling  variability  of  estimates  of  spectra  of 
wind  generated  gravity  waves  under  both  laboratory  (C.C.I.W.)  and  field 
conditions  (at  Lake  Ontario).  The  theory  checked  extremely  well.  Nonlinear 
effects  are  an  add-on  correction  as  far  as  the  basic  properties  of  estimated 
variance  spectra  are  concerned. 

The  variability  of  the  sixteen  spectra  estimated  from  sixteen  time 
histories  measured  in  the  wind  wave  flume  at  a  constant  fetch  for  a  constant 
wind  highlights  how  variable  spectral  estimates  can  be  and  that  neither  the 
significant  wave  height  calculated  from  equation  (3)  nor  the  location  of  the 
frequency  where  the  sample  spectrum  has  a  peak  is  well  known.  The  quotations 


from  Blackman  and  Tukey  (1949)  provide  a  constraint  that  is  unavoidable. 

The  data  processing  methods  that  were  used  to  determine  the  JONSWAP  five 
parameter  family  of  spectra  are  an  example  of  the  difficulties  and  contradictions 
that  can  arise  in  interpreting  spectra  as  discussed  by  Pierson  (1977)  and  by 
Donelan  and  Pierson  (1983),  the  latter  in  a  rather  restrained  way  (almost  by 
impl ication) . 

Implications  for  JONSWAP.  The  study  by  Gunther  (1981)  is  most  re¬ 
vealing  in  that  it  introduces  a  fundamental  dilemma  in  the  parametric  method 
for  forecasting  waves.  Three  spectra  with  the  five  JONSWAP  spectral  parameters 
chosen  to  represent  a  range  of  conditions  were  used  and  normalized  to  a 
dimensionless  form  by  f /Af  so  as  to  select  a  spectral  resolution.  The  expected 
value  of  the  normalized  variance  in  each  spectral  band  was  thus  known.  Given 
the  expected  value  for  each  spectral  band,  randomly  selected  values  from  a 
Chi  Square  distribution  with  30  degrees  of  freedom  were  used  to  generate  a 
succession  of  sample  spectra. 

As  shown  by  Pierson  and  Donelan  (1983),  if  the  entire  set  of  randomly 
generated  sample  spectra  had  been  averaged  frequency  band  by  frequency  band, 
the  result  would  have  been  very  close  to  the  input  spectra.  One  hundred 
samples,  if  averaged,  would  have  given  a  spectrum  with  3000  degrees  of  freedom 
within  ±  4%  of  the  input  spectrum  at  the  90%  confidence  level. 

This  was  not  done,  however,  each  sample  spectrum  was  separately  analysed 
to  recover  the  JONSWAP  parameters.  The  recovered  parameters  were  averaged 
and  their  standard  deviations  were  found.  The  output  spectra  based  on  the 
five  averaged  parameters  were  not  the  same  as  the  input  spectra.  (See  Gunther 
(1981)  and  Donelan  and  Pierson  (1983)).  This  result  exposes  a  fallacy  in  the 
definition  and  parameterization  of  the  JONSWAP  spectra  as  claimed  by  Pierson 
(1977).  It  also  produces  a  dilemma. 

Suppose  for  the  first  choice  of  the  dilemma  that  a  place  could  be  found 
somewhere  on  the  ocean  with  constant  offshore  winds  and  no  effect  of  depth, 
that  the  JONSWAP  parametric  model  (Hasselmann,  et  al.  (1976))  is  used  to 
predict  the  waves  as  a  function  of  fetch,  and  that  wave  records  of  standard 
duration  are  obtained  from  which  the  spectra  are  estimated.  (Spectra  are  not 
measured-some  authors  to  the  contrary.)  For  each  of  the  spectral  estimates, 
the  values  of  f  at  the  same  fetch  will  vary  from  one  spectrum  to  another 


and  the  value  of  y  (the  peak  enhancement  parameter)  will  scatter  considerably 
once  is  selected.  The  predicted  JONSWAP  spectra  will  not  be  verified  by 
the  individual  spectral  estimate*.  Under  the  assumed  conditions,  records,  say 
ten  or  twenty  times  longer  can  be  obtained.  If  these  are  fitted  by  the 
JONSWAP  method,  the  values  of  y  will  shrink  to  much  lower  values  and  the 
values  of  f  will  tend  toward  a  much  more  stable  value.  The  predicted  JONSWAP 
spectra  will  not  agree  with  the  spectral  estimates,  especially  in  shape. 

For  the  second  choice  of  the  dilemma,  use  all  of  the  above,  plus  the 
assumption  that  the  waves  that  are  generated  actually  do  have  the  JONSWAP 
spectra  predicted  by  the  parametric  model.  If  the  sample  spectra  are  used 
to  verify  the  parametric  model  predictions,  the  values  of  y  will  be  too  high, 
and  individual  values  of  f  will  still  scatter  in  an  unpredictable  way  as 
shown  by  Gunther's  simulation.  Under  these  assumptions,  spectra  estimated 
from  wave  records  ten  or  twenty  times  longer  than  usual  will  be  closer  to  the 
predicted  spectral  shapes. 

To  be  on  the  horns  of  a  dilemma  is  to  be  in  a  position  where  the  choice 
of  either  one  of  two  alternatives  is  equally  unfavorable.  One  choice  is  to 
claim  that  the  JONSWAP  parameters  are  a  valid  way  to  describe  fetch,  or 
duration,  limited  wave  spectra  The  parameters  will  be  a  function  of  the 
length  of  the  "stationary"  time  history,  and  sample  spectral  estimates  will 
not  verify  The  other  choue  ■  -  r  !.»im  that  the  parametric  method  predicts 
the  JONSWAP  spectral  shapes  <r-  >t.  Were  this  the  correct  situation,  the 
spectral  peaks  of  the  estimate.)  :>e<  •  ra  would  all  be  higher  than  those  of  the 
predicted  spectra  and  the  forecast  »  o!  t  not  verify.  Only  much  longer  records 
would  verify  the  method 

The  reader  will,  hopefully,  be  ibl>-  to  deduce  from  the  above  discussion 
that  the  first  alternative,  which  is  a  result  of  the  data  analysis  fallacy, 
corresponds  to  the  actual  situation  The  spectra  predicted  by  the  parametric 
model  are  not  predictable  spectra 

There  are  numerous  implications  of  the  above  results,  with  reference 
to  overshoot  phenomenon  and  the  verification  of  nonlinear  wave-wave  inter¬ 
action  theories.  There  is  the  strong  possibility  that, even  if  non-linear 
wave  interactions  are  possible,  their  effect  has  been  greatly  overestimated 
when  found  from  estimated  wave  spectra. 


The  Surface  Contour  Radar.  The  surface  contour  radar  (SCR)  has  been  used 

by  Walsh,  et  al .  (1983)  to  measure  waves  as  a  function  of  x,  y  (at  rapidly 

varying  t)  for  a  swath  beneath  an  aircraft  flight  line.  Stereo  photographs 

give  spectra  with  an  ambiguity  of  the  form,  S(k  ,  k  )  =  S(-k  ,  -k  ),  but  a 

x  y  x  y 

way  has  been  found  to  eliminate  the  false  part  of  the  spectrum  based  on  an 
effect  produced  by  the  moving  aircraft.  For  linear  waves  the  spectra  can 
be  transformed  to  the  form  S(oi,  9) .  Advantages  of  the  SCR  are  high  wave 
number  resolution  in  one  direction  and  a  large  number  of  degrees  of  freedom 
for  an  estimate  of  S(io,8)  AcoA9  for  typical  resolutions. 

A  study  was  made  of  the  growth  of  fetch  limited  waves  for  offshore  winds 
of  15  m/s  and  22  m/s  measured  at  400  m.  One  fetch  was  about  200  km  for  a 
flight  exactly  downwind  that  began  about  27  km  northward  along  the  coast  from 
Cape  May  New  Jersey.  The  other  flight  was  offshore  from  the  coast  of 
Maryland  about  100  km  south  of  Cape  Henlopen. 

Delaware  Bay  produced  some  most  interesting  effects.  The  distance  from 
Cape  May  to  Cape  Henlopen  is  about  18  km,  and  the  fetch  over  Delaware  Bay  is 
perhaps  twice  this  distance.  The  line  connecting  the  two  capes  can  be 
idealized  to  a  wave  generator  injecting  waves  with  a  spectrum  of  the  form, 

S(w, 9).  The  angular  spread  for  a  particular  frequency  can  be  described  in 
many  ways  with  perhaps  the  normalized  form  K(cos(8/2))  to  the  power,  P(u>), 
being  the  simplest. 

The  vector  wave  number  spectrum  was  estimated  (in  a  statistical  sense) 
at  various  places  along  the  fetch  as  the  plane  flew  downwind  off  the  New 
Jersey  Coast.  The  growth  of  the  waves  that  would  be  expected  in  the  absence 
of  Delaware  Bay  is  found  in  the  spectrum.  A  distinct  second  spectral  peak 
is  also  found  in  each  spectrum  that  can  be  traced  back  to  the  line  joining 
the  two  capes.  The  waves  emanating  from  the  bay  continue  to  grow,  even 
though  moving  at  an  angle  to  the  wind.  A  modification  of  spectral  growth 
theories  so  as  to  include  wind  direction  and  so  as  to  describe  a  directional 
wave  spectrum  would  account  for  these  observations  since  zero  fetch  would 
be  somewhere  inside  the  bay  and  the  rate  of  exponential  growth  for  a  par¬ 
ticular  frequency  would  begin  at  a  distance  related  to  this  zero  fetch 
baseline  instead  of  the  New  Jersey  Coastline. 


Whether  or  not  the  presence  of  waves  from  Delaware  Bay  inhibited  the 


growth  of  the  waves  farther  north  generated  by  the  offshore  wind  was  also 
studied.  The  results  were  not  conclusive,  but  there  was  not  any  convincing 
data  to  show  any  inhibitory  effects. 

Second  Order  Effects  on  Wave  Spectra.  The  work  of  Masuda,  et  al .  (1979), 
Kuo,  et  al .  (1979a,  1979b)  and  Mitsuyasu,  et  al.  (1979)  found  that  (1)  the 
main  part  of  the  wave  spectrum  could  be  treated  as  free  waves  with  minor 
corrections  to  their  phase  speeds  as  a  result  of  nonlinear  interactions, 

(2)  that  wind  wave  flumes  require  consideration  of  a  wind  drift  current  that 
is  relatively  more  important  in  the  laboratory  than  in  the  ocean  and  that  (3) 
the  only  nonlinear  effects  on  the  spectrum  were  bound  harmonics  at  frequencies 
past  1.6  times  the  spectral  peak. 

This  study  was  an  extention  of  the  work  of  Tick  (1959),  who  solved  the 
problem  for  long  crested  deep  water  waves  with  a  free  surface  described  by, 
say,  n(x,t) .  Their  extension  was  for  a  free  surface  given  by  n(x,y,t),  and 
assumed  directional  spreads  were  used.  The  results  were,  as  would  be  expected, 
an  improvement  if  compared  to  Tick  because  the  waves  were  short  crested. 

For  ocean  wind  waves,  the  correction  to  the  variance  spectrum  was  well 
past  the  spectral  peak.  The  correction  was  more  than  two  orders  of  magnitude 
down  from  the  peak  of  the  linear  part  of  the  spectrum.  Other  corrections 
were  more  than  three  orders  of  magnitude  down.  Phase  speed  changes  were 
minimal  increases  as  a  function  of  frequency.  The  bound  harmonics  at  second 
order  traveled  at  twice  the  speed  of  the  fundamental. 

Variance  spectra  are  not  particularly  revealing  in  the  study  of  non¬ 
linear  random  waves.  There  is  no  reason  to  believe  that  the  correction 
to  the  phase  speed  is  simply  the  same  constant  everywhere  for  a  given 
frequency.  There  are  certain  assumptions  in  these  results  concerned  with 
triple  products  of  various  terms  that  need  further  checking  by  other  means. 

They  may  not  be  correct. 

Generation  of  Monochromatic  Waves  by  Wind.  Mitsuyasu  and  Honda  (1982) 
have  shown  spectra  of  wind  waves  plus  monochromatic  waves  in  which  the  spikes 
for  the  monochromatic  wave  were  three  orders  of  magnitude  above  the  wind 
wave  spectra.  It  was  possible  to  compute  the  exponential  growth  rate  of  the 
monochromatic  wave  as  a  function  of  fetch  in  the  presence  of  wind  generated 


waves . 


With  the  wind  wave  spectra  almost  completely  eliminated  by  the  addition 
of  a  surfactant,  it  was  also  shown  that  monochromatic  waves  grew  at  essentially 
the  same  rate  after  a  correction  for  the  change  in  zq  and  u*  was  made  as  a 
result  of  the  surfactant. 

One  can  possibly  infer  that  a  spectral  component  in  a  wind  sea  grows 
independently  of  the  other  components  and  does  not  depend  on  nonlinear 
interactions.  The  results  of  Mitsuyasu  and  Honda  (1982)  do  not  fully  settle 
the  question  since  the  highest  frequency  used  was  somewhat  lower  than  the 
frequencies  in  the  wind  wave  spectra. 

Nonlinear  results  as  applied  to  the  study  of  spectra  do  not  seem  to  be 
producing  dramatic  results  in  these  examples.  The  application  of  nonlinear 
theories  to  the  study  of  waves  in  terms  of  wave  height  distributions  and  other 
wave  statistics  has  much  still  to  be  done. 


NEW  NONLINEAR  WAVE  THEORIES 


Introduction.  Many  new  departures  on  nonlinear  wave  theories  have  been 
initiated  over  the  past  few  years.  Their  connection  to  random  waves  on  the 
sea  surface  is  not  yet  clear.  There  may  never  be  any  connection  for  some  of 
the  results  that  have  been  obtained.  Before  plunging  into  a  summary  and 
critique  of  these  results,  it  appears  to  be  worthwhile  to  look  at  Stokes' 
waves  at  third  order  in  a  somewhat  different  way. 


2  2 

The  term  (a  k  )/2  in  equation  (6  )  only  arises  when  the  third  order 
terms  in  equations  (4)  and  (5)  are  found.  The  correction  to  the  phase  speed 
only  arises  at  third  order.  Since  kC  =  w,  (6)  can  be  changed  to  (7). 

a)  =  (gk)^  fl  +  (a2  k2)/2  (7) 

If  the  frequency  of  the  wave  cannot  change,  then  the  amplitude  of  the  wave  is 
determined  by  the  wave  number  as  in  (8), 

a  =  1  (k— — j-  -  1)1  **  (8) 

k  1  (gk)2 


For  a  constant  frequency  of  2tt/10,  or  a  period  of  10  seconds,  Figure  5 
shows  the  wave  profile  as  a  function  of  time  at  a  fixed  point  along  with  the 
values  of  a,  k,  L,  the  crest  height  and  the  trough  depression. 
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The  wave  profile  as  a  function  of  x  for  a  frequency  of  2tr/10  is  shown 
in  Fig.  6  along  with  all  of  the  other  appropriate  values.  The  highest  wave 
in  both  figures  corresponds  to  ak  =  0.5.  As  the  height  increases,  the  crest 
to  trough  assymetry  increases  and  the  wavelength  increases  from  157  meters 
to  196  meters,  an  increase  of  25%. 

There  are  several  interesting  ways  to  study  equations  (6) ,  (7)  and  (8) . 
If  the  amplitude  of  the  first  harmonic  is  increased  and  cj  is  kept  constant, 
the  wavelength  and  the  phase  speed  must  both  increase.  For  a  period  of  10 
seconds,  the  phase  speed  numerically  is  simply  one  tenth  of  the  wavelength. 
The  amplitudes  of  the  first,  second  and  third  harmonics  as  a  function  of 
wavelength  (and  phase  speed)  are  shown  in  Figure  7.  The  amplitude  of  the 
first  harmonic  increases  rapidly  as  a  function  of  wavelength  at  first  and 
then  more  slowly  whereas  the  third  harmonic  is  at  first  negligable  and  then 
increases  rapidly  as  the  value  of  ak  reaches  about  0.5  where  the  curves  stop 
abruptly. 


Figure  8  shows  graphs  of  various  sums  of  terms  that  arise  when  equation 
(4)  is  evaluated  all  as  a  function  of  the  value  of  a  for  a  period  of  10 
seconds.  The  curve,  a  =  a,  is  of  course  a  line  with  a  45°  slope.  The  curve, 
h  =  2a,  would  be  the  height  of  a  linear  wave.  The  curve,  a^,  is  the  absolute 
value  of  the  depth  of  the  trough,  i.e.  equation  (4)  for  k(x  -Ct)  equal  to  -it. 


Similarly,  a c  is  for  an  arguement  of  zero,  and  h 


a^  -  a^.  is  the  crest  to 


trough  height.  The  difference  between  h  and  2a  is  the  result  of  the  third 


harmonic,  and  the  values  of  (6),  (7)  and  (8)  all  depend  on  this  being  a 


solution  to  third  order.  The  required  wavelength  is  shown  on  an  auxiliary 
scale  as  the  value  of  a  is  varied. 


Finally,  the  phase  speed  is  graphed  as  a  function  of  wavelength  for 
a  k  =  0  and  a  k  =  0.5  in  Figure  9.  The  area  between  the  two  curves  would 
show  all  of  the  possible  values  for  phase  speed  and  wavelength  for  Stokes 
waves  if  a  third  order  pertubation  expansion  were  good  enough  up  to  ak  =0.5. 
Lines  of  constant  period  as  labled  radiate  from  the  origin. 


The  amplitude  of  the  waves  for  the  lower  curve  is  zero.  The  labled 
points  on  the  upper  curve  show  the  crest  to  trough  heights  of  (4)  for 
a  kg  =  0.5  at  the  phase  speed  and  wavelength  of  that  point.  There  is  a 
substantial  range  of  wavelengths  possible  for  a  fixed  phase  speed  and  a 
considereable  range  of  phase  speed  for  a  fixed  wavelength.  Quite  clearly. 
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k=0.04  L=  157.07 

C=  15.707  T=  10 

o  =  1.95 

h  =  3.91 

k  =0.038  L=  165.35  C=  16.535  T=  10 
o  =  6.35  h=  12.97 


k  =0.036  L=  174.53  C  =  17.453  T=I0 
a  =  9.40  h=  19.57 


k  =  0.034  L  =  184.80  C  =  I8.48  T=  10 
0  =  12.33  h  =  26.29 

20.60 


k  =  0.032  L=  196.35  C=I9.635  T=  10 
0  =  15.40  h  =  33.60 


FIG.  6  Non-linear  Forms  of  the  Stoke' s  Wave.  (X  in  Meters.) 
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FIGURE  7 
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8  Linear  Wave  Amplitude,  Crest  and  Trough  Values  and  Total 
Wave  Height  for  a  10  Second  Wave  as  a  Function  of  a. 
Dashed  Curve  Shows  Wave  Length  in  Meters  on  Right  Scale. 
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for  this  third  order  solution  as  a  in  (4)  is  increased,  if  k  (or  L)  is  kept 
constant  C  must  increase  and  hence  T  must  increase,  and  if  kC  is  kept  constant, 
T  remains  constant  and  k  must  decrease.  When  waves  vary  in  height  in  a  random 
sea,  is  there  a  corresponding  effect  from  wave  to  wave?  As  waves  in  a  random 
sea  grow  in  height,  do  they  lengthen  and  speed  up?  Or,  as  more  recent  results 
to  be  described  later,  suggest,  can  even  stranger  things  happen  when  random 
waves  pass  through  some  critical  value  of  ak  locally  before  they  break. 

More  Realistic  Periodic  Models.  A  third  order  perturbation  solution  for 
periodic  gravity  waves  following  the  techniques  of  Stokes  might  be  possible 
of  generalization  so  as  to  describe  random  waves  more  realistically.  Some 
interesting  results  on  the  non-Gaussian  properties  of  waves  have  been  obtained 
as  in  Huang,  et  al.  (1983).  M.  S.  Longuet -Higgins,  and  others  working  with 
him,  have  shown  enough  descrepancies  between  the  above  simple  analysis  and 
what  actually  happens  as  periodic  waves  increase  in  height  to  indicate  that 
even  partial  success  along  these  lines  will  be  difficult. 

For  the  first  example,  the  profile  of  a  high  wave  has  the  property  that 
3  n/3x  is  positive  along  the  entire  profile  except  close  to  the  crest  where 
an  angle  near  120°  is  formed  (See  Longuet -Higgins  (1979a)  his  Figure  5,  for 
example) .  The  motion  of  the  water  near  the  120°  corner  has  been  investigated 
in  detail  as  in  Longuet -Higgins  (1980)  .  The  above  1979  reference  also  shows 
that,  with  qualifications,  the  Stokes'  drift  for  the  higher  waves  is  about 
three  times  greater  near  the  surface  than  would  be  calculated  from  Stokes' 
original  equation.  At  the  surface,  for  each  complete  cycle  of  the  highest 
wave,  a  water  particle  advances  horizontally  by  an  amount  equal  to  0.38  times 
the  wavelength.  Its  average  speed  of  advance  is  0.27  times  the  phase  speed. 
Thus,  the  Stokes'  second  order  formula  is  not  applicable  for  very  steep  waves 
over  the  whole  range  of  depths.  Some  of  these  results  are  based  in  part  on 
previous  results  of  Yamada  (1957)  for  deep  water. 

A  very  simple  solution  for  the  almost  highest  wave,  which  does  not 
involve  an  infinite  series  and  problems  of  convergence  has  been  found 
(Longuet-Higgins  (1979b)).  With  x  positive  down,  y  positive  to  the  right 
and  with 

z  =  x  +  i  y 


X  =  +  i 

* 

io=u  +  iv=(d  x/dz) 
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(9) 

(10) 
(ID 


(where  *  is  the  complex  conjugate)  then  equation  (12)  provides  a  simple,  but 
not  exact,  solution  for  the  almost  highest  wave. 


x  +  i  y 


q  +  yi  x 
(6  +  i  x) 


1/3 


(12) 


For  ip  =  0,  x  =  $  and  (12)  gives  the  wave  profile  parametrically.  In 
(12),  a  =  1.6876,  6  =  4.8065  and  y  =  1.3104.  This  approximation  differs  from 
the  exact  solution  by  at  most  5%  and  over  most  of  the  profile  by  less  than 
1%  in  satisfying  the  pressure  condition  at  the  free  surface. 

Another  very  interesting  result  described  by  Longuet -Higgins  (1975,1978a) 

2 

is  that  the  higher  periodic  waves  are  not  the  fastest.  If  C  /gk  is  computed 

as  a  function  of  ak,  with  k  fixed  (or  L) ,  the  maximum  occurs  at  about  ak  =  0.436 

2 

and  equals  about  1.19.  As  a  increases  beyond  this  value,  C  /gk  decreases 
until  the  highest  and  steepest  wave  is  reached.  The  lines  of  constant  period 
for  a  given  L  in  Fig.  9  would  require  corrections  to  account  for  this  property 
of  periodic  waves.  According  to  Longuet -Higgins  (1978b). 


"  -  It  was  generally  thought  that  the  phase- speed  C  of  a  steep, 

progressive  gravity  wave  on  deep  water  always  increased  with  the  wave 
steepness,  at  given  wavelength.  However,  further  calculations  (Longuet- 
Higgins  (1975))  have  shown  that  the  wave  speed,  considered  as  a  function 
of  the  crest-to-trough  height  2a,  increases  to  a  maximum  within  the  range 
of  possible  heights,  and  then  diminishes;  the  highest  wave  is  not 
actually  the  fastest.  The  reason  for  this  behaviour  is  associated  with 
the  fact  that  the  profile  of  the  highest  wave,  having  a  very  sharp  curva¬ 
ture  at  the  crest,  intersects  the  profile  of  a  wave  which  is  not  quite 
so  high.  Hence  the  higher  wave  lies  below  it  over  most  of  the  wavelength 

-  .  Its  potential  energy  is  therefore  less,  and  its  kinetic  energy 

also. " 

Longuet -Higgins  and  Cokelet  (1976)  have  studied  the  deformation  of  steep 
surface  waves  on  water  by  means  of  numerical  computations.  Longuet -Higgins 
(1978b)  summarized  the  method  that  was  used  as  follows  (equation  and  figure 
numbers  have  been  changed). 

"Observations  suggest  that  steep  progressive  waves  have  a  strong 
tendency  to  become  steeper  on  their  forward  face  and  to  topple  forwards, 
producing  either  "spilling"  or  "plunging"  breakers.  The  development  of 
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the  wave  profile,  at  least  up  to  the  point  of  impact  of  the  jet  on  the 
forward  face,  has  been  recently  followed  by  a  new  method  described  by 
Longuet -Higgins  and  Cokelet  (1976) .  Although  restricted  in  the  first 
place  to  irrotational  waves  on  deep  water,  the  method  appears  to  be 
capable  of  extension  to  waves  in  the  presence  of  a  surface  shear  current 
and  to  water  of  infinite  depth. 

"The  motion  is  assumed  periodic  in  the  horizontal  coordinate  x  with 
length  scale  L,  though  not  periodic  in  the  time  t.  By  a  simple  substi¬ 
tution  £  =  exp  [ik(x+iy)]  the  free  surface  is  transformed  into  a  simple 
closed  curve  C  in  the  £ -plane,  and  all  points  at  infinite  depth  are 
transformed  into  a  single  point  in  the  interior.  By  a  suitable  choice  of 
coordinates  the  complex  velocity  potential  x  =  4>  +ix  becomes  regular  and 
analytic  everywhere  inside  C,  at  all  times.  The  velocity  field  in  the 
interior  of  the  fluid  depends  only  on  the  values  of  $  at  the  surface,  and 
the  computation  is  carried  forward  by  following  the  values  of  x,  y  and  (J> 
corresponding  to  marked  particles.  At  each  time-step  we  compute 


II 

3  Is 

rx 

(13) 

dy  _ 

dt  ' 

4> 

y 

(14) 

d<j> 

dt 

j  (Vcp) 2  -  gy  -  p/p 

(15) 

where  p  is  the  surface  pressure.  This  enables  the  value  of  <J>  on  C(t+dt) 
to  be  found,  and  hence  the  tangential  derivative  3  4>/3 s .  To  find  the 
normal  derivative  3<|>/3n  is  equivalent  to  solving  a  Dirichlet  problem: 

How  to  determine  the  normal  derivative  of  a  function  <p,  given  on  the 
boundary  and  satisfying 

V2<j>  =  0  (16) 

everywhere  in  the  interior. 

"  Longuet-Higgins  and  Cokelet  (1976)  find  the  solution  by  means  of 
an  integral  equation,  which  is  solved  numerically,  at  each  time  step.  The 
method  involves  the  neglect  of  viscosity,  compressibility  and  surface 
tension,  but  does  not  assume  the  particle  acceleration  to  be  small  compared 
to  g. 

40 


V;-: 


.V 


a 


f 


Ik!  U: 


"In  their  first  paper  (1976)  some  examples  were  given  of  the 
development  of  the  free  surface  from  a  symmetric  wave,  when  the  energy 
level  was  further  raised  by  the  smooth  application  of  a  surface 
pressure  distribution,  over  a  short  initial  period.  After  removing 
the  pressure,  the  free  surface  curled  over  and  plunged  forward,  in 
the  way  commonly  observed  (see  Figure  10). 

"In  another  set  of  numerical  experiments  Cokelet  (1979b)  has  begun 
with  a  free  wave  of  exactly  sinusoidal  form,  but  moderate  or  large 
amplitude  and,  without  applying  any  pressure  at  the  surface,  has 
followed  the  subsequent  development  of  the  wave  profile.  Just  as  in  the 
previous  case  the  wave  forms  a  high-velocity  jet  and  plunges  forwards 
(see  Cokelet  (1977b),  Figure  10).  This  happens  even  when  the  initial 
energy  is  less  by  a  factor  2/3  than  the  maximum  energy  for  a  steady, 
nonsinusoidal  wave  of  the  same  wavelength  — ." 

" —  The  momentum  in  the  jet  can  grow  to  be  a  substantial  portion 
of  the  total  momentum  of  the  wave  train. 

"To  follow  the  jet  beyond  the  instant  where  it  impinges  on  the 
forward  face  of  the  wave  requires  a  different  type  of  analysis,  which 
must  be  capable  of  modelling  the  turbulent  shear  flow  and  the  entrain¬ 
ment  of  air  and  of  momentum  from  the  laminar  part  of  the  flow.  A  start 
in  this  direction  has  been  made  by  Longuet -Higgins  and  Turner  (1974) 

The  breaking  waves  in  deep  water  can  be  followed  numerically  past  the 
breaking  point.  Fig.  11  shows  a  copy,  as  the  wave  curls  over,  of  one  of  the 
Figures  in  Longuet -Higgins  and  Cokelet  (1976).  Descriptions  of  some  of  the 
extreme  waves  and  of  ship  damage  at  sea  frequently  include  descriptions  of 
events  closely  paralleling  this  kind  of  a  wave  form. 

How  Waves  Break.  In  1976,  Longuet -Higgins  and  Cokelet  (1976)  wrote  in 
the  abstract,  "plunging  breakers  are  beyond  the  reach  of  all  known  analytical 
approximations."  Longuet-Higgins  (1982)  then  proceeded  to  find  these 
analytical  solutions.  The  water  need  not  be  shallow  for  a  plunging  breaker 
to  happen,  but  probably  the  examples  that  many  have  seen  are  the  motion 
pictures  of  surfboard  riders  "in  the  tube"  where  the  breaking  wave  has 
curled  up  and  over  the  surfer  but  the  plunging  part  of  the  wave  has  not  yet 
fallen  to  the  water  in  front  of  the  crest. 

*  Our  references  do  not  give  1979a. 
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The  theory  used  goes  back  at  least  as  far  as  the  work  of  John  (1952,  1953) 
and  to  previous  work  by  Longuet-Higgins  (1976)  .  The  start  is  with  equations 
(9),  (10)  and  (11),  but  instead  of  expressing  x  directly  as  a  function  of  z 
and  t,  it  is  assumed  that  y  and  t  are  each  analytic  functions  of  a  third 
complex  variable  to  (used  differently  here  than  elsewhere  in  this  review)  and 
of  t.  Then  by  eliminating  to  between  y  and  z,  it  would  be  possible  to  express 
y  as  an  analytic  function  of  z,  except  at  a  singularity.  Many  kinds  of  flows 
were  studied,  with  the  result  that  cubic  flows  in  the  parameter  to  (pure 
imaginary)  were  the  simplest  case  for  which  the  free  surface  can  interact  itself. 
The  singularity  is  inside  the  "tube"  where  there  is  no  water  and  the  free 
surface  forms  the  inner  portion  of  the  breaking  wave.  The  plunging  part  is 
more  or  less  in  free  fall.  Excellent  agreement  is  found  between  the  solution 
and  the  experiments  of  Vinje  and  Brevig  (1981)  as  shown  in  Figure  12. 

The  above  cited  references  are  all  based  on  the  equations  that  describe 
irrotational  flows.  Most  of  them  were  concerned  with  periodic  waves.  Unfor¬ 
tunately,  the  potential  function  needed  to  describe  waves  extends  over  an 
infinite  half  space.  It  is  thus  difficult  to  describe  steep  waves  near  their 
crests  and  breaking  and  to  treat  irregular  waves  with  varying  heights  and 
periods.  There  is,  however,  the  possibility  of  piecing  together  parts  of 
some  of  the  solutions  that  have  been  found  in  these  and  other  papers  of  the 
same  author  so  as  to  represent  storm  seas  more  realistically. 


Nonlinear  Models  for  the  Free  Surface  Movement  of  Irregular  Waves.  Because 


of  the  difficulties  involved  with  irregular  waves,  other  ways  have  been 
tried  to  study  them.  One  way  is  to  avoid  using  the  basic  hydrodynamic  equations 
and  to  derive  approximate  equations  for  the  motion  of  the  free  surface.  These 
attempts  appear  to  have  begun  with  Zakkarov  (1966)  and  Zakkarov  and  Shabat  (1972) 
Extentions  have  been  made  by  Dysthe  (1979)  Janssen  (1983),  and  Hui  and  Hamilton 
(1979) .  A  substantial  review  of  these  methods  (among  others)  is  given  by  • 

Yuen  and  Lake  (1982)  . 


The  concept  involved  was  to  show  that  the  nonlinear  behavior  of  waves  at 
the  free  surface  could  be  approximated  by  various  nonlinear  partial  differential 
equations  such  as  the  Schrodinger  equation.  Exact  solutions  to  this  equation 
with  a  hyperbolic  secant  envelope  have  been  found  and  called  solitons. 

Some  extentions  involve  extending  the  equation  which  is  supposedly 


realistic  at  the  order  of  (ak)  by  an  added  term  to  improve  it  to  the  order 
4 

of  (ak)  by  Dysthe  (1979)  and  other  types  of  solutions  as  in  Hui  and 
Hamilton  (1979),  for  which  the  waves  travel  at  an  angie  to  the  envelope. 

These  solutions  have  many  interesting  features.  Some  can  be  repro¬ 
duced  experimentally.  Others  cannot.  The  envelope  propagates  without  change 
of  shape  at  close  to  the  group  velocity  of  the  carrier  wave.  These  solutions 
are  still  under  experimental  study,  but  since  solitons  do  not  break,  they  do 
not  appear  to  be  capable  of  describing  storm  seas. 

Nonlinear  Dispersive  Waves.  Still  another  approach  to  the  problem  is 
that  of  Witham  (1965,  1970)  with  further  contributions  by  Lighthill  (1967) 
and  Yuen  and  Lake  (1975)  and  a  review  by  Yuen  and  Lake  (1982) .  The  concept 
is  to  treat  a,  k  and  gj  (and  consequently  C  and  the  group  velocity)  in  equations 
patterned  after  equations  (4),  (5)  and  (6)  as  slowly  varying  functions  of  x 
and  t  with  the  idea  that  the  hydrodynamic  equations  will  be  more  or  less 
locally  satisfied.  The  additional  constraints  required  are  found  by  applying 
variational  principles  to  the  Lagrangian  of  the  system. 

For  long  crested  waves  on  deep  water,  following  Yuen  and  Lake  (1975),  the 
equations  that  result  are  given  by  equations  (17),  (18),  (19)  and  (20) 


r|(x,t)  =  a(x,t)  cos  0(x,t)  +  (a(x,t))  k(x,t)  cos  2  0(x,t) 

2 

+  |  (a(x,t))3  (k(x,t))2  cos  3  0(x,t)  (17) 

4>(x,t)  =  t'1  |exp  (k(x,  t)  z  j  sin  0(x,t) 

(  at(x,t)  io(x,t)  ax(x,t)  )  r 

+  — k7V  TT*  cos  9^x*t^  +  - - - 5 —  1-kz  cos  0(x,  t)  •  exp  (k(x,t)z 

1  K(X>Z}  fkfx.  t'tr  )  J  l 


co(x,t)  ax(x,t) 


(k(x,  t))‘ 


+  sin  2  0(x,  t)  •  exp  (k(x,t)z) 


where  0  =  -w(x,  t) 


0  =  k(x,  t) 


(20) 


These  equations  have  a  strong  resemblance  to  equations  (4),  (5)  and  (6) 
with  some  additional  correction  terms.  The  derivations  require  that  0(x,  t) 
be  a  rapidly  varying  function  of  space  and  time  compared  to  a(x,  t) ,  hence 
the  concept  of  two  timing  as  in  Witham  (1970)  .  As  Witham  (1965)  states, 

"For  nonlinear  problems,  it  is  assumed  that  exact  analytic  solutions 
corresponding  to  [long  crested  dispersive  waves]  are  out  of  the  question 

and  the  possibility  of  finding  approximate  solutions  -  is  explored." 

A  further  study  of  these  equations  will  be  deferred  to  the  following  section. 


STUDIES  OF  DETERMINISTIC  WAVE  TRANSIENTS 

Long  Crested  Linear  Dispersive  Gravity  Wave  Transients  in  Deep  Water. 
Transient  groups  of  waves  have  the  property  that  if  observed  at  a  fixed  point, 
the  water  is  at  first  calm,  then  it  oscillates  up  and  down  according  to  some 
pattern  and  then  becomes  calm  again.  If  observed  as  a  function  of  distance 
along  a  wave  tank,  for  example,  the  disturbance  and  be  seen  to  occupy  a  finite 
portion  of  the  tank  at  any  instant  of  time  as  it  travels  along.  Mathematically, 
these  features  are  expressed  by  the  requirement  that 

lim  y  [T  (n(x,  t))2dt  =  0 

2T->  oo  J  J  _t 

and 

1  2 

lim  w  I  (n(x,  t))  dx  =  0 

2X  ->  °°  I  >  -X 

In  contrast,  the  above  limits  both  approach  a  constant  if  n(x,  t) 
represents  a  stationary  random  process.  The  techniques  of  "Power  Spectral 
Analysis"  are  therefore  inapplicable  for  transients. 

Transients  can,  however,  be  studied  in  terms  of  Fourier  integrals,  and 
the  theory  is  well  developed  for  a  linear  model.  Attempted  corrections  for 
nonlinear  effects  were  the  motivation  for  some  of  the  work  described  above. 

According  to  Lamb  (1932),  (Article  236  Chapter  IX),  'The  investigations 
of  Arts  227-234  relate  to  a  special  type  of  waves;  the  profile  is  simple- 
harmonic,  and  the  train  extends  to  infinity  in  both  directions.  But  since 
all  our  equations  are  linear  (so  long  as  we  confine  ourselves  to  a  first 
approximation),  we  can,  with  the  help  of  Fourier's  Theorem,  build  up  by 
superposition  a  solution  which  shall  represent  the  effect  of  arbitrary 
initial  conditions.  Since  the  subsequent  motion  is  in  general  made  up  of 
systems  of  waves,  of  all  possible  lengths,  travelling  in  either  direction, 
each  with  the  velocity  proper  to  its  own  wave-length,  the  form  of  the  free 
surface  will  continually  alter". 

Suppose  that  a  transient  is  observed  at  some  arbitrary  x  =  0  as  a 
function  of  time,  and  that  it  is  an  odd  function.  Then  the  Fourier  spectrum 
is  given  by 


(24) 


(25) 
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b(oj)  =  (^r  )  [  n(t)  sin  cot  dt  (26) 

'  _oo 

The  function  b(to)  is  an  odd  function  and  n(t)  can  be  chosen  such  that 
b(co)  will  have  a  positive  maximum  at  u)  =  and  a  negative  minimum  at  w=  -<^0> 
with  a  value  of  zero  at  to  =  0  as  in  (27) , 


b(co)  =  b^  (to-too)  -  b2(to+  u)Q) 
with  b(0)  =  0. 


(27) 


Then 


n(x,  t) 


-b^Cto-^)  sin  ((to  x/g)  -  cot)dto 


(28) 


can  represent  a  transient  such  that  n(0,  t)  recovers  n(t),  and  such  that  those 
spectral  components  for  to  >  0  are  traveling  in  the  positive  x  direction.  As 
long  as  b^  at  to  =  0  is  small,  the  mathematical  simplification  of  allowing  some 
part  of  the  disturbance  to  go  the  other  way  can  permit  the  explicit  evaluation 
of  a  particular  model. 


It  is  interesting  to  consider  the  Fourier  spectrum  of  n(x,  t)  as 
observed  at  some  other  x,  say  x  =  Xj  as  a  function  of  time.  Equation  (28) 
can  be  rewritten  as  (29) 


n(x1,  t) 


+ 


2 

ru>  Xj  ^ 

b(w-«0)  sin  [— — J 

2 


cos  ut  da) 


cos 


[W 

- I  sin  wt  dw 

l  8  J 


(29) 


where  the  first  term  is  even  and  the  second  term  is  odd  as  a  function  of 
time.  The  Fourier  transform  to  find  the  new  spectrum  at  x^  is,  say, 


A(w  ) 


I 

t)  cos  a)  t  dt 


(30 


and 

B(w  ) 


J  -CO 


n(x^,  t)  sin  oj't  dt 


(31) 


The  dominant  term  for  (30)  is 


1  {'  .  f  .  .  u  X1  f  1  sin(w-o)1  )tl  >* 

=  2?  -b("-“0>  sin  —  It  - — r—  dt 

J  -co  &  V.  (to— to  )  ' 


which  reduces  to 

,  (w')2x1 

A(u')  =  -b.  (to  -oj  )  sin  - 

1  o  g 


and  similarly 

r(to*D  2x  ^ 

B(co')  =  -  ojq)  cos  ^ - - - j 


so  that 


9  2  U 

(A“  +  BZ)  2  =  b. 


and  the  Fourier  integral  spectrum  has  a  certain  invariant  property  as  the 
disturbance  travels  as  a  function  of  x. 

It  is  usually  not  too  difficult  to  pick  an  f)(t)  such  that  (26)  can  be 
integrated,  but  the  integration  of  (28)  is  often  complicated.  As  a  last 
resort,  the  stationary  phase  approximation  can  be  used  (see,  for  example, 
Hui  (1983))  so  that 

\  I  n  ft Tt"  ^  7T  1  _ 


nu,  t)  •>-  \|X  b  [gj  sin 


Two  examples  are  known  for  n(t)  picked  in  such  a  way  that  both  (26)  and 
(28)  can  be  integrated.  One  is  equation  (37)  (See  Neumann  and  Pierson  (1966)) 


2  2 

n(t)  =  -a  e  sin  co  t 

v  J  o 


which  yields 


f  /4u)  2  g2  f  12-) 

n(x,t)  *  exp  — 2" —  (x  -  (gt/2u)o)J  J 


f  w0  x  ‘V  4B4t2x  1  „  -1  4B2x 
•  sin  l  iD  -  —  -  ~ D  g  +  2  tan  — 

where  D  =  1  +  (16  R4  x2/g2) 
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In  a  linear  theory 


■§£■  =  gn  at  z  =  0  (40) 

can  be  found  ($xx+  $Z2=  0) 

(41) 

2 

If  this  is  done,  the  term  x  -  (gt/2u)Q)  becomes 
2  2 

2  gz  B  g  7i  zt  ,  nv2 

z  -  -^-2 - 2 - +  (x  -(gt/2o>  )) 

4B  a)  ? 

o 

the  term  D  becomes 

2  4  2  2 

n  -  7  8B  z  16B  (z  +  x  ) 


« 

z/g))  . 

4 

the  fact  that  the  effects  of  the  shorter  waves 
decrease  more  rapidly  with  depth,  z,  (positive  up  and  not  x  +  iy  as  before) . 

A  second  solution  for 

n  =  -a  sin  u)  t  (42) 

o  ' 

if  -NT  <  t  <  NT  for  integer  N  and  zero  otherwise,  where  T  =  2tt/u)  ,  can 
o  o  o  o 

be  found  in  terms  of  Fresnel  integrals  (See  Kinsman  (1965)).  For  n  small, 
the  evaluation  is  messy,  and  it  is  rather  surprizing  how  well  (36)  seems  to 
fit  the  solution  for  low  waves  that  have  traveled  a  short  distance.  The 
envelope,  b(gt/2x),  has  a  form  that  seems  to  fit  the  transients  given  by 
Yuen  and  Lake  (1982)  (pg.  81)  and  Su  (1982)  (Figs.  2,  3  and  4  for  large 
fetches)  quite  well. 


g 


g 


1  -12 

and  tan  4B  x/g  becomes 


j  tan'1  j^4B2  x/g(l  -  (4B2 


These  changes  account  for 


so  that  the  time  derivative  of  the  potential  function 
by  integrating 
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Linear  Dispersion  and  Coalescence.  The  solutions  to  the  kind  of  problem 
described  immediately  above  are  supposedly  well  known.  If  near  some  x,  h(x,t) 
takes  a  brief  time  to  pass  at  x  =  0,  then  the  waves  disperse,  or  spread 
apart  with  distance  traveled  so  that  visibly  longer  waves  with  longer  apparent 
periods  arrive  at  a  given  point  first,  followed  by  moderately  long  waves  and 
moderate  apparent  periods  and  finally  by  the  shorter  waves  with  shorter  apparent 
periods.  What  is  probably  also  well  known,  but  not  usually  recognized,  is  that 
waves  can  un-disperse,  or  coalesce.  For  example,  if  (38)  is  evaluated  at  some 
moderately  large  negative  value  of  x  as  a  function  of  time,  short  apparent 
period  waves  are  seen  first,  followed  by  moderate  apparent  period  waves  with 
the  long  apparent  period  waves  coming  along  last.  As  the  coalesing  group 
moves  along,  the  waves  all  combine  to  produce  (37)  at  x  =  0. 

Some  Wave  Tank  Experiments.  The  concept  of  generating  a  coalescing  wave 
transient  was  described  by  Cummins  (1962).  Periodic  waves  of  varying  heights 
and  random  seas  were  two  available  ways  to  determine  the  response  of  model  ships 
to  waves.  The  idea  that  a  transient  would  contain  a  wide  range  of  wave  fre¬ 
quencies,  and  yet,  in  a  sense,  be  deterministic,  was  suggested  as  a  way  to  reduce 
the  number  of  test  runs  of  a  particular  model. 

The  mathematical  aspects  of  the  analysis  were  difficult,  high  speed  computers 
were  not  quite  high  speed  enough,  and  the  concept  of  an  FFT  would  not  be 
available  for  at  least  three  more  years  (Cooley  and  Tukey  (1965)).  With  all 
of  the  inherent  difficulties,  Cummins  (1962)  had  the  insight  to  realize  that 
something  new  might  be  learned  by  studying  the  transient  response  of  a  model 
ship  to  a  passing  wave  transient. 

Davis  and  Zarnick  (1964)  persued  this  line  of  investigation  in  1962  and 
1963.  Their  figures  (Figs.  6,  12,  27,  28)  have  considerable  similarity  to 
our  Figures  13  to  16.  The  theories  used  in  both  of  these  studies  were  linear 
theories  whereas  as  the  transient  coalesces  the  waves  in  it  become  more  and 
more  decisively  nonlinear.  Although  a  linear  theory  yields  the  same  Fourier 
integral  spectrum  for  a  transient  everywhere  except  for  phase  shifts,  the 
nonlinear  effects  are  quite  different  and  a  model  encountering  the  transient 
during  different  stages  of  its  coalescence  would  behave  quite  differently. 

At  Hydraulics  Laboratory  of  the  National  Research  Council  of  Canada  and 
the  Hydraulics  Laboratory  of  the  Canada  Centre  for  Inland  Waters,  experiments 
in  which  rcproducable  wave  transients  can  be  generated  and  studied  have  also 


been  done.  These  experiments  can  help  bridge  the  gap  between  nonlinear 
periodic  waves  and  their  properties  and  the  random  nonlinear  wave  found 
in  nature. 

Mansard  and  Funke  (1982  )  needed  to  test  models  of  communication  buoys 
and  various  structures  for  the  effects  of  waves  that  break  over  them,  or 
around  them.  They  developed  a  way  to  generate  transients  at  the  wave  board 
(including  a  model  for  the  board  transfer  function)  such  that  the  waves  would 
travel  down  the  tank  and  coalesce  as  they  advanced.  At  the  test  object,  a 
plunging  breaker  (See  the  three  photographs  in  the  reference)  could  be  formed 
as  well  as  various  forms  of  isolated  short  duration  wave  groups.  They  were 
different  from  the  above  linear  theory  only  in  that,  if  they  did  not  break, 
they  were  very  nonlinear.  Fig.  13  shows  the  input  wave  form  at  a  distance 
of  1.8  m  from  the  wave  generator,  relative  to  an  arbitrary  zero  as  redrawn 
from  the  reference.  The  oscillations  from  zero  to  20  seconds  are  too  rapid 
to  redraw  well  and  so  only  the  envelope  is  shown  as  the  apparent  periods 

decrease  to  the  left.  The  reproducable  output  from  the  generator  would  be 

described  in  engineering  term  as  an  amplitude  modulated  frequency  modulated 
chirped  signal.  The  input  at  1.8m,  after  traveling  21  m  changes  into  the 
second  wave  form.  The  highest  crest  to  trough  wave  in  the  input  wave  form 

is  roughly  0.15  m.  At  22.8  m,  the  height  is  0.28  m  (almost  double). 

Referred  to  by  the  authors  as  an  episodic  wave,  the  mechanism  of  its  forma¬ 
tion  is  the  coalescence  of  a  dispersed  train  of  waves  with  short  waves  in 
front  and  longer  waves  behind  that  could  conceivably  occur  at  random  times 
and  places  in  an  actual  seaway. 

Figure  14  illustrate  attempts  by  Mansard  and  Funke  (  1982  )  to  generate 
various  wave  forms  at  a  measurement  point  22.8  m  from  the  generator.  The 
left  hand  column  of  figures  shows  what  was  desired,  and  the  right  shows  what 
was  measured.  For  the  first  three,  the  mean  to  crest  and  the  mean  to  trough 
desired  signals  had  the  same  value.  The  measured  signal  had  higher  crests 
and  shallower  troughs.  High  waves  will  not  behave  as  linear  waves.  The 
last  desired  transient  was  made  nonlinear,  and  the  output  differs  from  it  in 
peculiar  ways.  The  third  measured  transient  looks  more  like  the  fourth 
desired  transient  than  the  fourth  measured  transient. 

A  study  of  transients  produced  by  a  generator  capable  of  moving  in  the 
horizontal  direction  exactly  as  programmed  by  a  time  varying  signal  is  being 
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FIGURE  14  Desired  and  Measured  Transients 
Wave  Tank.  Redrawn  from  Mansard 
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done  by  M.  Donelan,  W.  H.  Hui  and  W.  Pierson.  Results  are  only  in  a  preliminary 
stage,  but  we  plan  to  study  both  low  transients  and  transients  with  increasing 
amplitude.  It  is  possible  to  check  some  of  the  results  of  the  various  theories 
described  previously.  There  is  a  transfer  function  between  the  motion  of  the 
generator  and  the  resulting  wave  motion  so  that  the  vertical  scales  are  not 
comparable,  nor  are  the  scales  for  the  recorded  data.  The  gain  was  changed 
from  one  run  to  another  to  optimize  the  digital  records.  The  general  shapes 
for  the  inputs  and  the  outputs  are  comparable.  The  time  origins  are  not 
comparable  from  one  pair  to  another. 

The  top  part  of  Figure  15  shows  a  sine  wave  with  a  box  car  envelope  that 
is  turned  on  instantaneously  at  the  start  and  off  instantaneously  at  the  end. 
There  are  eight  oscillations  under  the  box  car.  It  is  difficult  to  do  this 
with  a  generator  and  the  response  of  the  water  in  front  of  the  generator  must 
be  complicated.  The  waves  10  m  from  the  generator  probably  do  not  have  a 
spectrum  that  relates  to  the  input  function  very  well.  The  amplitude  of 
the  input  was  low.  However,  for  the  waves  recorded  at  60  m,  instead  of 
eight  there  are  quite  a  few  more.  Long  apparent  period  waves  arrive  first, 
and  so  on.  The  envelope  has  a  main  maximum,  preceded  and  followed  by  two 
smaller  groups,  which  in  turn  may  be  preceded  and  followed  by  two  still 
smaller  groups. 

The  waves  that  passed  the  wave  recorded  at  60  n  were  recorded  on  an 
FM  tape  recorder,  and  the  result  was  played  back  into  the  wave  generator 
with  the  time  reversed.  The  waves  recorded  this  second  time  around  have  co¬ 
alesced  and  almost,  but  not  quite,  reformed  the  first  input  signal,  except 
that  it  is  upside  down.  The  sign  of  the  time  reversed  signal  should  have 
been  changed.  The  first  trough  is  a  little  too  small  but  the  next  seven 
troughs  are  all  nearly  equal.  The  first  crest  is  an  extra  one,  but  it  is 
followed  by  eight  crests  of  nearly  equal  height  and  these  are  trailed  by  a 
small  trough,  a  smaller  crest,  another  small  trough  and  some  wiggles. 

Although  the  second  record  is  not  an  exact  duplicate  of  the  original 
input  to  the  generator,  it  comes  close  with  differences  probably  explain¬ 
able  by  the  form  of  the  input  function  and  the  generator  transfer  function. 

Figure  16  shows  an  input  signal  based  on  equation  (37) .  The  chart 
speed  was  doubled  to  get  a  good  look  at  the  function  so  that  it  has  to  be 
bunched  up  to  half  its  present  length  when  comparing  it  to  the  other  curves. 


There  are  five  crests  and  five  troughs.  The  generator  was  driven  with  a 
low  amplitude  signal.  By  the  time  the  waves  had  traveled  60  o,  they  had 
dispersed  with  one  smoothly  varying  envelope,  with,  of  course  ,  long 
apparent  periods  at  front  and  the  short  ones  at  the  rear.  For  this  example, 
the  signal  at  60  m  was  played  back  into  the  generator  with  time  reversed,  but 
for  one  run  slightly  amplified  and  for  the  second  run  substantially  amplified. 

For  the  slightly  amplified  input,  which  was  still  quite  low  and  nominally 
nearly  linear,  the  highest  crest  was  3.9  and  the  lowest  trough  3.1  chart 
scale  units.  The  coalesced  record  had  become  nonlinear.  The  chart  scale 
units  suggest  a  value  of  ak  2i  0.2  had  the  wave  been  one  of  constant  wave- 
number.  For  the  moderately  nonlinear  record,  there  were  a  few  extra  low 
oscillations,  but  the  original  input  was  nearly  recovered. 

With  an  even  higher,  but  still  time  reversed  signal,  the  record  at  60  m 

f\j 

was  highly  nonlinear  (ak  =  0.5+)  .  The  waves  did  not  break,  but  the  recorded 
waves  were  quite  distorted  compared  to  the  input.  Further  amplification  of 
the  input  resulted  in  one  wave  in  the  group  breaking  at  an  ex  post  facto 
predictable  location  upon  repetition  in  the  wave  tank. 

In  a  random  seaway,  every  once  in  a  while  a  train  of  waves  can  form  that 
are  traveling  in  just  the  right  direction  with  increasing  apparent  wave  length 
and  period  in  a  direction  opposite  to  that  of  the  moving  waves.  It  is  then 
possible  for  this  group  of  waves  to  coalesce  into  one  exceptionally  high  wave, 
or  more,  that  seems  to  form  out  of  nowhere.  Fortunately,  they  are  rare  and 
can  break  and  disperse  almost  as  quickly  as  they  form.  They  might  be 
described  as  "episodic"  but  they  are  surely  not  "freaks"  because  they  are 
explainable,  to  a  large  extent,  by  known  properties  of  gravity  waves  and 
they  can  be  formed  at  will  experimentally  in  the  laboratory. 

The  SN AMI!  review  (Seakeeping  1953-1973)  cites,  the  papers  by  Cummins  (1962) 
and  Davis  and  Zamick  (1964)  in  reviews  by  Ogilive  and  Beck  (1974)  and  Ochi  (1974). 
However,  Dalzell  (1974)  reviewed  the  area  of  model  testing  in  random  seaways 
and  notes  both  his  (and  others)  surprize  that  the  linear  model  seemed  to  work 
quite  well  even  in  extreme  seaways.  The  details  of  an  extreme  event  (that  does 
not  cause  capsizing  as  in  Paulling  and  Wood  (1974))  may  be  lost  in  the  computational 
procedures  used  to  analyse  a  random  process. 

Ever  higher  periodic  waves  arc  completely  unrealistic.  Perhaps  something  can 
be  learned  about  the  nature  of  the  catastrophic  events  reported  by  men  of  the  sea 
by  c<  Lning  the  right  kinds  of  transients  with  a  random  seaway  and  making  sure 
that  the  model  will  be  in  the  right  place  at  the  right  time  (or  if  one  prefers  the 
wrong  place  at  the  wrong  time  for  survivability) . 


A  Nonlinear  Transient  Wave  Packet.  The  solution  of  (37)  given  by  (38) 
might  somehow  be  related  to  equations  (17)  to  (23)  with  a  little  care  in 
changing  cosines  to  sines.  However,  the  argument  of  the  sine  in  (38)  when 

used  to  calculate  a  local  wavenumber  and  frequency  from  (19)  and  (20),  say 

a,  ^ 

k  and  ui,  will  not  give  a  local  progressive  wave  of  the  form 

ri(t)  =  a  sin  (kx  -  oat)  =  a  sin  0  (43) 

Also,  the  0  in  (17)  must  be  a  function  of  the  local  amplitude  of  the  wave 

'X.  % 

train  which  involves  the  exponential  term  in  (38),  and  0  =  -w  plus  0x  =  k 
must  also  satisfy  (22)  .  Whatever  phase  function  results  must  also  satisfy 
(38)  when  the  amplitude  of  the  disturbance  goes  to  very  small  values. 
Finally,  the  expression  for  the  potential  function  in  (18)  does  not  provide 
for  the  added  complexity  that  k  also  varies  with  depth  as  in  (41) . 


Nevertheless,  it  is  interesting  to  try  to  see  how  close  one  can  come 

to  an  extreme  wave  by  studying  a  coalescing  and  then  dispersing  transient. 

To  do  this, we  consider  the  transient  to  be  a  nonlinear  improvement  on  (38) 

and  study  its  behavior  as  a  function  of  time  at  various  fixed  values  of  x, 

designated  by  x  =  xT .  The  envelope  in  (38)  is  a  maximum  as  a  function  of 

time  when  t  =  t  =  2oj  x,/g  so  that  t  =  t  +  t  can  be  a  transformation  of 
o  o  1  o 

the  time  to  a  value  when  the  peak  of  the  transient  passes,  x  =  Xj .  Thus, 
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and 
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t  -  t  =  t 
o 


(45) 


The  argument  of  (38)  reduces  to  equation  (46) 


0  =  - 


2 

Uo  *1 


\o  +  D(x. ) g 


1  0  i  1  -\f  2  1 

^  t  jt  ^  tan  [4B  Xj/gj 


so  that  the  local  frequency  is 

4B  xi  .■ 

"  *  "o  *  c 


The  value  for  a(x,  t)  in  (17)  becomes 


A  =  a(x,t)  = 


OHxijy 


exp  (-(4u£  B2(t*)2)/g2 (D(x* )) 


(46) 


(47) 


(48) 


and  all  that  is  needed  is  k  (x,  t) . 


If  it  is  assumed  that  (47)  and  (48)  are  correct,  then  (22)  can  be 

*\j 

solved  numerically  for  k(x,t),  and  (17)  can  be  evaluated  as  a  function  of 
1 

t  for  a  fixed  x^ .  The  phase  function  thus  depends  on  the  local  amplitude 
of  the  envelope  as  the  transient  passes  by. 

2  1 

For  a)  =  2tt/1.5,  B  =  ^  and  a  convenient  value  for  a  in  the  linear 
o  5  , 

model,  tQ  can  be  varied  in  0.25  second  steps  to  find  x^,  and  t  can  then 
be  varied  in  0.25  second  steps  to  generate  time  histories  at  each  x^ . 

When  referred  back  to  true  time  t,  the  values  for  the  same  time  at  successive 
values  of  x^  generate  the  wave  profile  as  a  function  of  x. 

Figures  17a  and  17b  show  the  wave  profiles  as  a  function  of  x  as  time 
varies  from  -4.25  seconds  to  4.00  seconds.  The  transient  is  moving  from 
left  to  right.  Much  farther  to  the  left,  the  transient  would  be  a  sequence 
of  many  low  waves  with  long  waves  to  the  rear  and  short  waves  in  the  front 
as  in  previous  figures.  At  -4.25  seconds  there  is  the  crest  of  the  long 
wave  at  -8  meters,  a  trough  at  about  -5.5  meters  a  crest  at  -4.4  meters,  a 
trough  at  -3  meters  and  a  small  crest  near  -2  meters.  The  transient  has  not 
reached  x  =  0. 

As  time  passes  the  rear  wave  grows  in  height  and  shortens  in  length 
until  -2.00  seconds.  It  is  followed  by  a  trough  that  gets  deeper  and  deeper 
until  -0.75  seconds,  and  as  the  trough  forms  a  crest  begins  to  build  up 
behind  it. 

The  deepest  trough  forms  at  -0.75  seconds  with  a  value  of  -0.25  m,  and 
if  scaled  up  by  Froude's  law  by  10  so  that  was  2tt/15,  the  result  would  be 
a  fairly  realistic  "hole"  in  the  sea  as  described  in  the  introduction. 

The  wave  form  is  nearly  an  odd  function  of  x  at  t  =  0  in  Figure  17b  . 
except  that  the  crest  is  higher  and  the  trough  shallower.  (Perhaps  -0.25 
is  even  closer  to  an  odd  function) . 

The  crest  that  started  to  build  at  about  -1.0  seconds  reaches  its  peak 
at  0.75  seconds  with  a  value  of  0.42  m.  It  rapidly  progresses  through  the 
envelope  decreasing  in  height  until  it  is  nearly  gone  and  quite  long  at 
4.00  seconds. 


6 


1 1  ’ 


[G.  17b  Successive  Wave  Profiles  as  a  Function  of  x  at 
Different  Times  as  a  Transient  Passes  x  =  0. 
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Near  1.25  seconds  another  crest  starts  to  form,  increases  in  height 
until  3.25  seconds  and  then  starts  to  shrink.  It  is  never  as  high  as  the 
crest  at  0.75  seconds.  The  end  of  the  sequence  at  4.00  seconds  look  very 
much  like  the  mirror  image  of  the  -4.00  profile  near  the  start  of  the 
sequence.  Long  waves  are  now  at  the  front  and  short  waves  are  at  the  rear. 
The  wave  form  at  0.75  seconds  looks  like  the  time  histories  that  Buckley 
(1983)  gives  for  Hurricane  Camille. 

The  analysis  used  here  is  probably  not  quite  right,  although  the 
envelopes  of  transients  are  frequently  propagated  in  recent  papers  at  the 
group  velocity  of  the  carrier  frequency  as  in  (38) .  Had  equation  (21)  been 
used,  the  envelope  would  become  distorted  and  a  nonlinear  function  of 
itself.  The  group  velocity  would  increase  with  increasing  wave  amplitude, 
and  it  would  be  difficult  to  keep  track  of  time  and  space  reference  points. 
The  results  obtained  are  considered  to  be  close,  but  not  close  enough,  to 


THE  PROBABILITY  DENSITY  FUNCTION  FOR  THE  CREST  TO  TROUGH 
HEIGHTS  OF  OCEAN  WAVES 

Review.  The  significant  wave  height  was  originally  defined  in  terms  of 
visual  observations.  When  waves  were  first  recorded  instrumental ly  as  a  time 
history,  a  relationship  between  the  visual  observations  and  the  waves  in  a 
time  history  was  needed.  The  definition  that  seemed  to  fit  best  was  that  the 
significant  wave  height  was  the  average  of  the  one  third  highest  waves  in  a 
time  history  as  recorded.  The  significant  period  was  the  average  of  the 
periods  of  these  one  third  highest  waves.  The  beginning  attempts  at  wave 
forecasting  tried  to  predict  these  two  numbers. 

Upon  the  development  of  spectral  analysis  procedures,  the  work  of  Rice 
(1944),  which  is  valid  for  any  linear,  hence  Gaussian,  process  showed  that 
the  envelope  of  a  Gaussian  process  has  a  Rayleigh  pdf.  For  a  narrow  band 
process,  a  quick  extention  to  the  pdf  of  crest  to  trough  wave  heights  was 
made  as  in  (1)  . 

The  moments  of  an  ocean  wave  spectrum  are  defined,  depending  on  the 
author,  as 

E.  =  m.  =  y.  =  or1  S  (gj)  du>  (49a) 

J  J  1  Jo 

for  j  =0,  1,  2,  3  and  4,  and  such  that  for  j  >  4,  the  concept  of  a  moment  for 
waves  starts  to  break  down  because  one  is  looking  at  high  frequencies  beyond 
the  range  of  most  wave  recorders  and  various  kinds  of  noise  can  contaminate 
the  calculation  from  spectra  estimated  from  actual  data. 

Given  f(h)  in  (1),  the  value  h^  can  be  found  such  that 

f (h)dh  =  j  (49b) 

1  t 

Then  h  j  =  3  j  hf(h)dh  =  4 .0046 (E  )  2  (50) 

6  Jhj 

Since  Eq  is  not  very  well  known  from  a  typical  wave  record,  the  equation 
can  be  simplified  to  (3). 

Difficulties  begin  to  arise  when  the  same  wave  time  history  is  processed 
in  two  diffirent  ways;  one  by  ranking  the  waves  from  the  lowest  to  the  highest 
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and  averaging  the  heights  of  the  one  third  highest  waves  and  the  other  by 
estimating  the  spectrum,  integrating  it  to  get  E  ,  and  computing  h  y  from  (3). 

If  the  first  way  yields  a  value  that  is  nearly  aways  less  than  the  second 
as  in  Figs.  1,  2,  and  3,  there  is  reason  to  question  the  validity  of  the 
Rayleigh  pdf.  Even  though  the  confidence  limits  on  the  estimate  of  the 
significant  wave  height  from  the  estimated  spectrum  is  usually  broad  enough 
to  include  the  significant  wave  height  computed  from  the  actual  time  history, 
the  fact  that  the  two  are  different  with  one  nearly  always  lower  than  the 
other  for  the  same  time  history  is  the  source  of  the  problem. 

The  difficulty  is  not  only  with  reference  to  the  significant  wave  height 
but  also  can  affect  the  value  of  the  average  of,  say,  the  one  tenth  highest 
waves  and  the  expected  value  of  the  highest  wave  in  a  sample  of  size,  N. 

Only  the  last  of  the  three  ways  to  study  extreme  wave  heights  described  by 
Ochi  (1982)  would  remain  valid,  and  the  data  base  for  this  method  is  limited 
and  not  of  very  good  quality. 

Past  Results.  The  crest  to  trough  wave  height  pdf  has  been  studied  by 
Longuet -Higgins  (1952,  1980)  Spring  (1978)  Forrestall  (1978)  Tayfun  (1981a, b,  1983) 
and  Pierson  and  Salfi  (1982)  (unpublished  report  for  DTNSRDC) ,  among  others. 

An  ocean  wave  time  history  as  a  Gaussian  (and  hence)  linear  process  can  be 
represented  by 

x(t)  =  n(t)  =  R(t)  cos  6 (t)  (51) 


Instead  of  x(t)  in  a  Taylor  series  about  t  =  0,  one 
Salfi  (1982)  follow  Ehrenfeld,  et  al.  (1958)  and  express 
Taylor  series  about  a  time  picked  at  random  as  in  (52) . 

n(t)  =  |Rq  +  R  t  +  Rt/2jsin  ^0q  +  0t  +  0t  /2 

It  was  possible  to  find  the  multivariate  pdf  of  the 
in  (52)  as,  say, 

f(Ro,  R,  R,  0Q,  0,  0) 
which  upon  integration  over  the  domain  of  definition  yields  one. 

The  treatment  of  the  0  term  in  the  pdf  above  Was  intractable  and  (52)  was 
simplified  to  (53)  by  integrating  out  0. 


can  as  in  Pierson  and 
R (t)  and  0(t)  in  a 

(52) 

six  random  variables 
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The  above  pdf  would  have  to  be  evaluated  over  a  five  dimensional  space 
to  generate  portions  of  equation  (52) . 


n(t)  =  (R  +  Rt  +  Rt  /2)  sin  (0Q  +  9t) 


If  0  =  0  (or  by  a  similar  analysis  tt)*,  a  crest  will  occur  at  8t  =  tt/2 


so  that 


(Tfl  •  •  ••  2  »2 

—  =  R  +  Rtt/20  +  R  tT/80 

20J  ° 

A  trough  will  occur  at  8t  =  -  tt/2  so  that 
n|-  =*  -  Jrq  -  R  tt/ 2 0  +  R  TT2/802 


The  height  of  the  wave  will  be 


••  9  *2 

h  =  2  R  +  R  tT/40 
o 


Equation  (56)  shows  that  h  is  not  dependent  on  R  to  this  level  of  approxi- 
mation.  If  the  above  pdf  is  integrated  over  R  first,  and  then  0,  the  result  is 
equation  (57) . 


f(R,  R,  0)  =  - — J~At-  exp  -H  (|  Rq  +  |  (R  -  (02  -  -(-D6^  -)R0)2|  (57) 

o  2tt  tx1  E  2  E  o  A  E  o  j 

o  '• 


The  transformation  of  variables 

fp^ 

Ro  *  W  4 


"  \h\'2  f *2  D8  +  f|  ( E] ^ . 

R'-  E  9  -  —  M  5 


0  =  8 


whose  inverse  is 


With  crest  and  trough  interchanged 


yields  equation  (64)  from  equation  (57)  and  equation  (65)  from  equation  (56) . 
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where  0<6<°°,  -  <»<y<00  and  0  <  0  <  00 . 

Equation  (64)  has  been  factored  into  the  form 


(64) 


(65) 


£(«,  Y,  0)  =  f1  (6)  •  f2(y)  •  f3(0)  (66) 

and  therefore  6,  y  and  0  are  independent  (a  surprising  result  indeed) .  The 
wave  height  is  a  complicated  function  of  6,  y,  and  0.  In  principle,  it 
ought  to  be  possible  to  find  an  appropriate  transformation  of  variables  and 
appropriate  ranges  of  integration  to  obtain  the  pdf  of  f(h),  but  so  far,  little 
success  has  resulted  from  this  line  of  investigation. 

Instead,  for  a  given  set  of  spectral  moments,  Eq,  Ej,  E2>  E^,  E4>  (or 
mg,  m^,  m2,  m^,  m^,  in  other  notation)  the  fact  that  6,  y  and  0  are  independent 
can  be  used  and  a  random  sample  of  values  of  h  can  be  generated  and  compared 
with  experimental  data. 

The  procedure  is  to  select  at  random  triplets  of  numbers,  one  from  a  unit 
normal  distribution  for  y  and  the  other  two  from  the  rectangular  distribution 
over  zero  to  one.  The  cumulative  density  functions  for  5  is  given  by 


f  r 6 


P(6)  =  4= 

v24 


to 

.  o 


-x2/2  ,  .  -62/2 

e  '  dx  -  6  e  ' 


(67) 


where  0<  P(5)  <  1  is  a  number  from  the  0  to  1  rectangular  pdf. 


The  cumulative  density  function  for  0  is  given  by 


i, 
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P(0)  = 


<Eo  E2>  ’  +  E1 


u  E  -  E  0 

e1 - - - L- 


2  •  »? 
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(E,  -  2E.9  +  E  0  ) 

2  I  o 


(68) 


again  where  P(9)  is  a  number  from  the  0  to  1  rectangular  pdf. 


In  finding  the  cdf,  it  was  necessary  to  determine  a  constant  K  so  as 
to  renormalize  the  pdf  of  9.  It  is  given  by  (69) . 


(E2  +  Ex 


2  E 


(69) 


To  find  6  and  9,  a  simple  table  of  P(6)  versus  6  and  P(9)  versus  9  can  be 
constructed,  and  given  P ( 5)  and  P ( 0)  the  value  of  6  and  9  can  be  found. 


The  randomly  chosen  values  of  y,  6  and  9  then  yield  a  random  value  for 
the  wave  height  h.  A  sample  of,  say,  500  values  of  h  ought  to  yield  a  fairly 
reliable  cumulative  density  for  h. 


Equations  (56)  and  (57)  provide  some  insight  of  what  might  be  expected. 
If  R  were  very  small  (56)  would  reduce  to  the  Rayleigh  pdf.  Small  values  of 
9,  which  corresponds  to  longer  than  average  periods,  and  higher  waves,  will, 
produce  frequent  negative  values  of  R  and  thus  higher  waves  may  be  less 
frequent  than  the  Rayleigh  predicts.  Conversely  large  values  of  9,  which 
correspond  to  short  periods  and  low  waves  will  produce  positive  values  of  R 
and  hence  low  waves  will  be  less  frequent  than  the  Rayleigh  predicts. 


The  symbols  ip>  0,  E,  F  and  A  above  are  defined  as  follows: 

\f>  =  E_  -  2E,  0  +  E  92 
2  1  o 

D  =  E  E  -  E.  E_ 
o  3  12 

E  -  £o  E2  -  Ej2 
F  *  E2  -  E1  E3 


-E2  -E3  E4 


(70) 

(71) 

(72) 

(73) 

(74) 


The  difficulties  with  theories  that  depend  on  the  higher  moments  of  wave 
spectra  lie  in  the  increased  sampling  variability  of  the  moments  because  the 
moment  spectra  are  more  nearly  white  noise  and  in  the  contamination  of  the 
high  frequencies  in  the  spectral  estimates  by  digitization  and  round  off  errors 


in  the  original  wave  data. 


The  definition  of  the  spectral  width  parameter  used  by  Spring  (1978)  is  (75) 


E  Ea 

o  4 


E  E 
o 


in  our  notation. 


(75) 


To  avoid  the  use  of  spectral  moments.  Spring  (1978)  used 


(76) 


where  e  is  the  spectral  width  parameter  given  in  Tables  1,  2  and  3.  T£  is  the 
average  time  interval  between  crests  and  T is  the  average  time  interval 
between  zero  up  crosses. 


For  data  tabulated  in  the  form  of  pairs  of  wave  heights  and  wave  "periods", 
T  ,  can  be  found  by  averaging  the  periods  and  tabulated  values  of  e  as  in  Table 
1  then  yield  T  .  The  reciprocals  of  these  are  and  £4  as  in  the  definitions 
of  (77)  and  (78). 


S(f)df 


(77) 


E 
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K 


f  f4  S(f )df 


(78) 


and  thus  f^  =  (1  -  e^) f ^  (79) 

To  Monte  Carlo  equations  (65)  and  (66)  also  requires  f^  and  f^  defined 
by 

Eq  fj  =  £  f  S(f)df  (80) 

-3  r  3 

Eq  f3  =  I  f'5S(f)df  (81) 

It  can  be  shown  that  f ^  <  fj  <  f4»  and  that  (70)  to  (74)  are  always 

positive.  Reasonable  guesses  for  f^  and  f^,  plus  a  nondimens ionalizat ion  of 
h  were  then  used  to  generate  sample  cdf's  by  a  Monte  Carlo  simulation. 


Departures  from  the  Rayleigh  were  large  and  not  very  well  related  to  the  actual 
samples  that  were  used  from  parts  of  the  total  record. 


If  there  is  any  particular  virtue  to  this  theory,  it  will  need  to  be  tested 
against  data  samples  of  sufficient  quality  to  make  the  moments  reliable  and  that 
are  long  enough  in  duration  and  stationary  (statistically)  so  that  a  reliable 
comparison  can  be  made.  The  data  from  Camille  lacks  stationarity  and  have 
other  strange  properties,  as  will  be  shown  later. 


An  Alternate  Approach  to  the  Same  Problem.  Tayfun  (1981a)  solved  a  problem 
for  the  distribution  of  crest  to  trough  wave  heights  in  a  completely  different 
way  for  a  linear  Gaussian  model.  The  pdf  requires  only  the  first  and  second 
moments  of  the  spectrum,  but  the  spectrum  must  be  narrow  band.  In  his  notation. 


frLJ 


S(u>)  dco 


(82) 


and  the  required  spectral  parameter  is 


v2  = 


(83) 


U)  (JJ, 

o  1 


Where  u>  =  u, /y 
o  1  o 


(84) 


so  that  v  can  be  rewritten  as 


2  ^2  %  -  U1 


v  = 


(85) 


It  is  also  required  that  v  «  1. 


The  amplitude  of  the  waves  is  also  scaled  somewhat  differently  in  this 

h 


work.  If,  for  example,  q(t)  is  scaled  by  £  =  n(t)/(2yo)  ,  the  Rayleigh  pdf 


takes  the  normalized  form 


f(x)  =  2x  e 


-x 


for  x  >  0 


(86) 


Two  points  on  the  envelope  of  the  wave  time  history,  separated  by  the 
time  interval  x,  where  the  envelopes  function  is  found  by  finding  the  time 

A 

history  of  the  quadrature  function  of  n  say  q  and  forming  (87), 


x(t)  =  (n  +  rr) 


can  be  shown  (probably  going  back  to  Rice  (1944)),  as  in  Middleton  (1960), 
to  have  the  joint  pdf  given  by  (88)  with  x  as  a  parameter  where  x^  and  x^  are 
points  on  the  normalized  envelope  x  units  of  time  apart. 


f(x.,  XT) 
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,  2  o  I  .  2 

1  -  r  v  1-  r 
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1  r  +  x2  1 

Jexp  l‘TT7~J 


for  Xj,  x2  >.  0  (88) 

where  IQ(*)  is  the  zero  order  modified  Bessel  function  of  the  first  kind. 

In  (88),  r  is  analogous  to  a  correlation  coefficient.  It  is  defined  by 


r(x)  =  (p*  +  pp  2 


where 


p.  (t)  =  y~  S(u>)  (cos(oj  -  uj  )x)dw 
loo  0 


P2(t)  =  yQ  o  S(co)(sin((D  -  0)o)x)dw 


According  to  Tayfun  (1981a) t with  small  changes  in  his  wording,  and  some 
deletions; - 

2 

The  narrow-band  condition  v  <<  1  suggests  that  the  underlying  spectrum, 
S(co),  is  centered  sharply  around  w=  u)q,  and  behaves  as  a  pseudo-delta-function 
Correspondingly,  each  realization  of  n  has  the  distinctive  form  of  slowly 
modulated  oscillations  with  an  apparent  mean  frequency,  u>  . 


When  the  spectrum  is  wide-band  so  that  the  condition  v4<<  1  is  not 

satisfied  strictly,  the  rate  of  change  of  the  envelope  could  be  rather  large* 

In  this  case,  realizations  of  n  can  no  longer  be  viewed  as  amplitude-modulated 

oscillations  with  an  apparent  mean  frequency,  u)q.  Consequently,  the  physical 

interpretation  of  the  envelope  as  an  amplitude  is  somewhat  lost.  Nonetheless, 

the  formalism  provided  remains  valid,  and  a  local  wave  height  can  still  be 

defined  as  H  =  2A.  Noting  that  H  =  2A  ,  it  then  follows  that  the  scaled 

rras  rros 

height  £  =  H/H  has  the  invariant  probability  density  depicted  by  the 
Rayleigh  form  of  equation  (9),  regardless  of  whether  the  condition  v^«  1  is 
satisfied  or  not.  - 


If  corresponded  to  a  normalized  crest  at  time,  t,  and  a  normalized 
trough  followed  at  t  +  T/2,  a  half  cycle  later  then  +  Xj  would  be  the 
normalized  crest  to  trough  wave  heights. 

The  general  case  for  which  t  /  T/2  is  discussed  as  related  to  the  wave 
spectrum  through  the  previous  four  equations,  and  involves  an  integration  over 
the  pdf  of  the  distribution  of  wave  periods  (halved) .  This  is  not  attempted 
and  further  assumptions  are  needed.  These  are  obtained  by  expanding  the 
trigonometric  terms  in  (90)  and  (91)  as  Taylor  series  which  gives  a  simplified 
expression  for  r(t)  in  the  form  of  equation  (92). 

r(x)  =  p(x)  =  1  -  (ojo  tv)2/2  (92) 

The  approximation  essentially  reduces  to  the  assumption  that  each  wave  has 
the  same  half  period  given  by  t  =  tt/cj^,  and  then  those  values  of  x^  and  x^  that 
add  to  2x  are  found  by  a  convolution  by  defining 

x^  =  2x  -  u 

x2  =  u 

and  evaluating  the  integral  in  (95) 

f (x)  =  2  \2x  f(2x  -  u,  u;  ~  )du  (95) 

>  o 

This  reduces  to  the  computational  problem  of  numerically  evaluating  (95), 
given  (92)  with  t  =  tt/u) i  since  the  pdf  has  been  normalized  to  degenerate  to  the 
Rayleigh  form  for  v  =  0.  Two  examples  were  computed,  one  for  v  =  0.1  and  the 
other  for  v  =  0.2.  Fig.  17  shows  the  result  for  v  =  0.2  compared  to  the  Rayleigh 
pdf.  The  points  for  v  =  0.1  are  about  0.01  higher  than  the  Rayleigh  pdf  at  the 
peak,  and  v  =  0.2  pushes  the  approximations  that  were  made  about  as  far  as 
possible. 

These  results  verify  the  guesses  that  were  made  in  the  study  by  Pierson  and 
Salfi  (1982)  and  obtain  them  in  a  much  simpler  way,  but  the  results  would  not 
apply  too  well  for  hurricane  waves.  The  results  also  imply  some  problems  in 
verification  to  be  pointed  out  later  since  successive  waves  would,  of  necessity, 
have  to  be  highly  correlated. 


(93) 

(94) 
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FIGURE  17  Generalization  of  the  Crest  to  Trough  Wave  Height 
Probability  Density  Function  for  a  Narrow  Band 
Spectrum  (Redrafted  from  Tayfun  (1981a),  with  the 
Points  for  v  =  0.1  Onitted) . 


Effects  of  Breaking  Waves.  If  waves  were  Gaussian,  and  if  it  would  still 
be  possible  to  consider  the  effects  of  breaking  within  the  constraints  of  a 
linear  model,  or  perhaps  a  second  order  model  because  the  second  harmonic 
could  cancel  for  crest  to  trough  heights,  then  one  would  expect  some  of  the 
higher  waves  in  a  high  enough  seaway  to  be  limited  in  height  by  breaking. 

The  higher  waves  predicted  by  the  Rayleigh  pdf  would  each  be  some  fraction 
lower  had  they  broken  prior  to  passing  some  point  where  they  are  measured  as 
a  function  of  time. 

Counter  effects,  as  the  waves  approach  the  measurement  point,  are  the 
steepening  just  before  breaking  with  the  formation  of  a  sharp  angular  crest 
with  a  corner  flow  as  described  previously.  Harmonic  analysis  is  probably 
the  wrong  way  to  attempt  to  describe  the  waves  since  a  large  number  of  bound 
harmonics  would  be  needed.  The  crest  to  trough  height  of  a  wave  about  to 
break  would  be  greater  by  some  unknown  fraction  than  predicted  by  a  linear 
or  second  order  model. 

Tayfun  ((1981b) ,  (1983))  has  treated  the  limitations  of  wave  height  by 
breaking  in  a  linear  Gaussian  model  (with  reasons  why  it  probably  still  applies 
at  second  order) .  The  mathematics  of  the  derivation  is  contained  in  the  above 
reference . 

With  6  defined  as 


B  =  k  (2  u)h  (96) 

e  o 

to  represent  a  measure  of  wave  steepness  where 

ke  =  u\/g  (97) 

2  ^ 

for  =  (1  +  v  )  2  (see  equation  (83  )),  (98) 

and  N  defined  as 

N  =  (tt/7  6)  2  (99) 


for  a  normalized  wave  height  given  by 


the  modified  pdf  is  given  by 


f(x)  =  xN 


.  4  -1  f  N*5  1  ..  ,r. 

1  -?cos  [  — J  U[  — 


Jo  ^ 


(101) 


where  U  is  the  Heaviside  unit  step  function,  JQ(0  the  zero  order 

Bessel  function,  and  0  <  x  <  (2N) 2 . 


Figure  18  from  Tayfun  (1981)  shows  both  a  different  way  to  plot  the  data 
and  the  behavior  of  a  mapped  inverse  cdf  as  N  is  varied.  Values  for  N  =  5, 

10  and  00  are  shown.  On  the  original,  N  =  15,  30  and  60  (which  falls  halfway 
between  10  and  °°)  are  also  shown. 


For  this  type  of  normalization,  note  that  the  normalized  cdf  for  N  =  00 
is  given  by 

F (?)  =  1  -  exp  (-(C)2)  (102) 

which  can  be  transformed  to 

C  =  (-£n  (l  -  F(C)))!s  (103) 

so  that  the  log  of  (1  -  F(C))  can  be  graphed  against  C  starting  from  C  =  0 
where  F(C)  =  0  and  0  has  an  antilog  of  one.  The  curve  that  results  is  the 
probability  that  C  exceeds  the  value  plotted  on  the  vertical  axis.  A  sample 

_3 

point  at  10  requires  1000  independent  value  from  the  sample  for  verification 


An  example  in  Tayfun  (1983)  of  the  pdf  plotted  in  the  usual  way  shows 
slight  deficits  compared  to  the  Rayleigh  for  low  values  of  the  scaled 
height,  excesses  past  the  maximum  of  the  pdf  and,  as  expected,  deficits 
past  a  scaled  height  of  about  2.  The  differences  between  this  result  and 
the  result  for  a  narrow  band  spectrum  without  breaking  is  that  the  pdf  for  the 
narrow  band  process  peaks  near  the  maximum  of  the  Rayleigh  whereas  the 
breaking  wave  model  corrects  the  pdf  with  higher  values  past  the  peak.  Both 
pdf's  are  lower  than  the  Rayleigh  for  larger  values  of  the  scaled  height. 

For  both  of  these  models,  the  significant  wave  height  and  the  average  of 
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FIGURE  18  Modified  Wave  Height  Exceedence  Probabilities  from  Tayfun 
(1981b).  (Curves  for  N  =  15,  30,  and  60  Have  Been  Omitted) 


the  one  tenth  highest  waves  would  be  functions  of  three  parameters  from  the 
spectrum,  i.e.  y  ,  y^  and  y^,  which  in  turn  can  be  used  to  compute  all  of 
the  other  parameters  that  arise  in  the  derivation.  The  spectra  would  have 
to  be  quite  well  known,  either  theoretically  or  as  estimated  from  a  high 
quality  time  history.  Given  a  sample  of  crest  to  trough  wave  heights,  and 
the  moments,  whether  or  not  the  presently  available  tests  are  good  enough 
to  distinguish  between  these  models  and  the  Rayleigh  would  be  a  matter  of 
interest . 

Thornton  and  Guza  (1983),  in  a  study  of  the  pdf's  of  waves  as  they 
travel  through  the  breaker  zone  showed  that  the  Rayleigh  distribution  fitted 
to  the  root  mean  square  height  described  the  significant  height  and  the 
height  of  the  one  tenth  highest  with  average  errors  of  -0.2  %  and  -1.8%, 
respectively,  both  before  the  breaker  zone  and  after  reforming  after  breaking 
with  some  energy  loss.  A  model  based  on  a  shoaling  coefficient  and  a  single 
free  parameter  explained  the  data  within  ±  9%. 

Other  models  to  account  for  wave  breaking  in  shoaling  water,  as  a  review 
of  the  results  of  five  different  authors,  were  also  given,  but  none  of  thorn 
would  have  given  a  Rayleigh  pdf  in  the  shallower  water.  They  write,  after 
reviewing  these  various  models,  and  the  theories  that  have  been  developed  to 
modify  the  wave  height  distribution  in  deep  water,  that  "These  studies  seek 
to  explain  deviations  from  a  Rayleigh  distribution,  but  the  relevant  point 
here  is  that  wave  heights  appear  to  be  nearly  Rayleigh  under  a  much  wider 
range  of  conditions  than  the  strict  assumptions  of  a  narrow  band  Gaussian 
(linear)  process  would  imply". 


TESTING  PROBABILITY  MODELS 


Definition  and  Consequences  Thereof.  If  the  random  variables  X^,  X2, 
Xj...Xn  have  a  joint  density 


..x/V 


X- . . *x  ) 
3  n 


that  factors  into  the  form 


f(x^)  f . . . .f (xr)  where  f(*)  is  the  same  pdf  for  each  X^,  then  X^, 
X2.. .X  is  defined  to  be  a  random  sample  of  size  n  from  a  population  with  a 
pdf,  f(*).  See,  for  example.  Mood,  et  al,  (1963).  The  problem  is  that 
successive  crest  to  trough  ocean  wave  heights  may  not  meet  the  requirement 
that  they  form  an  independent  random  sample. 


In  the  spectral  domain,  the  difficulty  is  overcome  and  successive 
spectral  estimates  as  the  frequency  varies  obtained  via  an  FFT  are  independent. 
It  could  then  be  shown  that  1024  points  equally  spaced  from  a  time  history 
would  be  the  rough  equivalent  of  150  values  from  a  random  sample  for  the 
waves  studied  by  Donelan  and  Pierson  (1983) .  The  more  narrow  band  the 
spectrum  the  fewer  the  effective  number  of  independent  points  for  a  given 
time  history. 


For  the  estimated  spectrum  obtained  from  measurement  of  a  wind  generated 
sea,  it  is  possible  to  compute  the  covariance  function  as  in,  say, 

r 

ft  (t)  =  I  S(u>)  cos  cot  do)  (104) 

'  o 

where  ft (0)  =  E  =  m  =  u 
o  o  Ho 

the  function,  R(t),  is  at  first  positive,  then  goes  negative  by  perhaps  20 
to  30%  of  ft (o) ,  and  then  positive  again.  Successive  crest  to  trough  wave 
heights  may  consequently  be  correlated,  and  if  the  X^  represent  wave  height 
the  joint  pdf  of  X^,  X2-..Xn  may  not  be  factorable  into  the  form  required  for 
an  independent  random  sample. 

Most  of  the  wave  theories  described  above  are  either  for  periodic  waves 
(2  degrees  of  freedom  for  a  linear  theory),  or  slowly  varying  functions  of 
x  and  t,  or  for  waves  with  a  narrow  band  spectrum.  Wave  height  samples 
under  these  assumptions  do  not  qualify  as  random  samples.  One  of  the 
problems  to  be  addressed  in  the  following  is  the  effective  numbers  of 
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independent  wave  heights  in  a  sample  of  say,  N,  successive  wave  heights  from 
an  ocean  wave  time  history. 

There  are  two  tests  that  are  routinely  applied  to  determine  whether  or 
not  a  given  random  sample  could  have  been  obtained  from  a  population  with  a 
certain  pdf.  One  is  the  Chi  Square  goodness-of-fit  test  based  on  a  sample 
histogram.  The  other  is  the  Kolmogorov-Smimov  goodness-of-fit  test  based 
on  a  sample  cdf.  The  latter  is  not  applicable  to  this  particular  area 
because  all  parameters  of  the  pdf  need  to  be  known  a  priori. 

The  usual  Chi  Square  test  has  two  hidden  traps  in  it.  One  is  the  not 
very  well  specified  requirement  to  pool  the  tails  of  the  sample  histogram 
before  Chi  Square  is  found.  Some  light  will  be  shed  on  this  rather  vague 
rule.  The  other  is  that  the  test  gives  too  high  a  value  for  Chi  Square  if 
the  sample  is  not  an  independent  random  sample. 


The  Chi  Square  Test.  Given  a  theoretical  pdf  of  the  form,  f(x^;  p^, 
p2,...p  )  that  depends  on  q  parameters,  that  in  general  have  to  be  estimated 
from  the  sample  as  say  p^  p2,  .  ..p^,  and  an  independent  random  sample,  X^, 
X2,...X  ,  as  defined  above,  it  is  possible  to  divide  the  range  of  X  into 
some  k  +  1  mutually  exclusive  sets,  say,  those  for  the  X's  such  that 

-  <=°  <  X.  <  X  , 

1  —  a’ 

X  <  X.  <  X  +  AX 
a  l  —  a 

X  +  AX  <  X.  <  X  +  2AX 
a  l  —  a 

and  so  on  to 

X  +  kAX  <  X.  <  00  . 
a  l 

The  number  of  X.  that  fall  in  each  set  can  be  counted  to  find  N .  as  j 

i  J 

goes  from  1  to  k  +  1 .  The  probability  that  a  given  X^  will  fall  in  one  of 

the  ranges  defined  above  is  the  integral  of  the  theoretical  pdf  over  the 

range  defined  above,  say,  p?  and  the  expected  number,  if  the  hypothesis  that 

the  sample  came  from  the  assumed  pdf  is  made,  is  given  by  npj  where  n  is 

the  total  sample  size. 


The  quality  can  then  be  computed  as  equation  (105). 
k+1  (N  .  -  n  p?) ^ 

Q  ?~-l  4- 

k  j=l  np? 


(105) 
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Under  these  circumstances  has  a  limiting  distribution, as  n  approaches 
infinity,  of  Chi  Square  with  k  -  q  degrees  of  freedom  where  q  is  the  number  of 
parameters  determined  from  the  sample.  Just  how  large  n  ought  to  be  to  use 
the  test,  which  is  a  large  sample  test,  is  open  and  if  the  q  parameters  are 
estimated  efficiently  some  of  the  lost  degrees  of  freedom  can  be  recouped. 
(Mood,  et  al .  (1963)  pg  443). 


Suppose  now  that  the  n  values  of  X^  are  not  independent  and  that  the  sample 
is  somehow  mysteriously  disguised  such  that  the  true  sample  size  is  n,^,  =  an 
where  0  <  a  <  1.  To  pick  an  example,  suppose  the  200  successive  wave  heights 
in  a  sample  from  Camille  are  effectively  the  equivalent  of  only  100  independent 
sample  values. 


A  corrected  value  of  would  be  given  by 


i  ■? 


k+1  (aN.  -  anp?)2 


=  a  Q 


(106) 


j  =  l  anp. 


and  the  Chi  Square  value  to  be  used  to  test  the  hypothesis  that  the  sample  comes 
from  f (X,  •  •  • , Pq)  would  be  halved.  For,  say,  k  +  1  =  15  and  q  =  2,  the 

appropriate  Chi  Square  has  12  degrees  of  freedom  and  if  Q°  were  to  exceed  21, 
the  hypothesis  would  be  rejected  at  the  5%  level.  Actually  the  value  is 
closer  to  10.5,  which  could  easily  happen  more  than  half  the  time.  For  the 
study  of  successive  crest  to  trough  wave  heights  in  a  time  history,  something 
equivalent  to  the  effective  number  of  independent  samples  value,  as  needed  for 
spectral  estimation,  is  needed  before  goodness-of-fit  tests  can  be  made. 


MARKOV  CHAINS 


Introduction.  If  successive  crest  to  trough  wave  heights  were  inde¬ 
pendent,  then  many  of  the  assumptions  in  the  previous  derivations  would 
not  be  very  good.  If  they  were  independent,  the  waves  might  have  very 
strange  properties  compared  to  what  is  believed  to  be  known  about  them. 

One  of  the  simplest  ways  to  model  a  process  that  does  not  have  the  prop¬ 
erties  required  for  an  independent  random  sample  is  to  study  Markov  chains. 
This  process  simply  assumes  that  the  probability  of  the  next  event  in  a 
sequence  depends  only  on  the  present  state  of  the  process. 

Each  of  the  fifty  five  samples  of  200  successive  waves  was  treated  as 
if  it  were  a  ten  state  Markov  process.  The  200  waves  were  ordered  in 
ascending  order  from  the  lowest  to  the  highest.  State  one  for  a  Markov 
process  was  defined  to  be  the  20  lowest  waves;  state  2  the  next  20  and  so 
on  to  state  10  as  the  highest  20. 

With  the  waves  in  their  original  order,  the  first  wave  in  the  sequence 
of  200  would  be  in  one  of  the  ten  states.  The  next  wave  would  also  be  in 
one  of  the  ten  states.  The  transition  probability  can  then  be  estimated 
by  counting  the  number  of  times  one  of  the  waves  in  state  1  is  followed 
by  a  wave  in  states  1  to  10,  counting  the  number  of  times  one  of  the  waves 
in  state  2  is  followed  by  a  wave  in  states  1  to  10  and  so  on  to  state  10 
for  the  initial  state.  To  keep  things  simple,  it  is  necessary  to  look 
ahead  to  wave  201,  the  first  wave  of  the  next  sample. 

As  examples,  the  transition  counts  for  records  6  to  10  are  shown  in 
Tables  4-6  to  4-10.  Each  row  sums  to  twenty.  Since  there  are  ten  possible 
transition  states,  if  wave  heights  were  completely  random,  there  would  be 
on  the  average  two  counts  in  each  element  of  each  row.  The  chance  of  all 
twos  in  every  element  of  the  matrix  is  small  so  that  200  wave  samples  are 
not  large  enough  to  provide  useful  estimates. 

In  these  five  tables  the  counts  scatter  from  zero  to  six  (the  sixes 
are  in  4-6,  state  10  to  state  4  and  in  4-10,  state  8  to  state  9).  The 
individual  values  are  not  much  different  from  a  sample  from  a  binomial 
discret  density  function  with  p  =  0.1  and  n  =  20. 


Each  table  of  counts  also  shows  the  length  of  time  in  minutes  for  the 
waves  in  the  sample,  the  highest  wave  in  the  sample  and  the  "period"  of 
that  wave.  These  values  were  summarized  at  the  start  by  Fig.  4  and 
Table  4. 

With  55  successive  wave  samples  for  a  total  of  11,000  waves  during 
Camille,  the  ordering  procedure  for  200  wave  sets  has  taken  out  most  of 
the  effect  of  trends  in  wave  height.  The  arrays  of  sample  counts  for  sets 
of  five  records  1  to  5,  6  to  10,  11  to  15,  and  so  on  to  51  to  55  were  pooled 
to  form  a  count  matrix  for  1000  waves  yielding  eleven  arrays  for  1,000  waves 
each. 


Table  4-C-2-1  shows  the  count  array  for  the  second  1,000  wave  combined 
sample.  Each  row  totals  to  100,  and  for  complete’/  random  waves  each 
element  of  each  row  should  be  10.  The  difference  between  what  was  actually 
counted  and  10  is  shown  in  the  second  array. 

There  are  no  zeros  in  the  array.  A  wave  in  any  state  (10%  height  range) 
can  be  followed  by  a  wave  in  any  of  the  ten  states.  The  lowest  values  in  the 
array  are  5's  and  the  highest  values  are  18' s.  A  binomial  distribution  with 
p  =  0.10,  n  =  100  would  have  an  expected  value  of  10  and  twice  the  standard 
deviation  would  be  6.  Since  the  distribution  is  skewed,  a  range  from  5  to 
18  is  not  unusual . 

When  converted  to  the  transition  probability  matrix,  the  result  is 
shown  in  Table  4-C-2-2  .  Any  initial  state,  can  change  to  any  other  initial 
state.  A  wave  in  the  lowest  10%  can  be  followed  by  a  wave  from  the  highest 
10%  with  an  estimated  sample  probability  of  0.12.  Each  probability  for  a 
completely  unpredictable  sequence  of  heights  ought  to  be  0.10.  It  is  clear 
that  for  this  1000  wave  set,  the  prediction  of  the  height  of  the  wave  that 
follows  a  particular  wave,  given  its  height  as  a  number  from  one  to  ten  is 
practically  a  random  choice  at  this  degree  of  resolution. 

If  successive  wave  heights  could  be  treated  as  a  Markov  process,  the 
square  of  the  transition  matrix  would  give  the  probabilities  for  the  states 
of  the  second  wave  to  follow  a  wave  in  a  known  state.  The  square  of  the 
matrix  in  4-C-2-2  is  shown  in  4-C-2-3.  The  values  differ  from  0.10  only  in 
the  third  decimal  place  except  for  state  6.  Under  these  assumptions, 


knowing  the  height  of  a  passing  wave  provides  very  little  knowledge  of  how 
high  the  second  wave  to  follow  it  will  be.  The  cube  of  the  matrix  in  the 
next  table  differs  from  0.10  by  at  most  three  parts  in  1000. 

The  actual  two  step  count  transition  probability  matrix,  and  its  square 
are  given  in  Tables  4-C22-1,  4-C22-2  and  4-C22-3.  The  element  of  randomness 
dominates.  The  counts  scatter  from  5  to  18  again,  whereas  Markov  theory 
based  on  the  square  of  the  one  step  matrix  would  predict  all  elements  to  be 
much  closer  to  10.  There  is  little,  or  no,  apparent  relationship  between 
the  one  step  counts  and  the  two  step  counts. 

The  square  of  the  actual  two  step  matrix  gives  an  idea  of  the  chance  of 
predicting  the  fourth  wave  to  follow  a  wave  of  known  state.  Pick  a  number 
at  random  from  one  to  ten. 

Tables  5-51  to  5-56,  5-C-ll-l,  and  so  on,  provide  similar  tables  for 
the  last  five  samples  of  200  waves  and  the  result  of  combining  the  five 
samples  into  one  1000  wave  sample.  The  contrast  between  4-C-2-1  and  5-C-ll-l 
is  striking.  The  values  near  the  main  diagonal  tend  to  be  above  ten  and 
those  in  the  upper  right  and  lower  left  tend  to  be  below  ten.  Although  any 
state  could  go  into  any  other  state,  a  10%  lowest  wave  goes  to  the  10%  highest  only 
twice.  The  counts  for  the  transition  of  any  wave  in  states  one  to  four 
to  states  eight  to  ten  are  all  under  ten.  The  transition  from  state  ten 
to  state  ten  was  two  and  a  half  times  greater  than  the  chance  value  with  a 
sample  probability  of  0.25. 

The  square  of  the  transition  matrix  indicates  some  residual,  but  weak, 
capability  to  predict  the  second  wave  to  follow  a  wave  in  a  known  state. 

Some  of  the  values  differ  from  0.10  in  the  second  decimal  place,  and  the 
signs  of  the  differences  are  negative  in  the  upper  right  and  the  lower  left 
comers  and  positive  in  the  upper  left  and  lower  right  corners. 

The  actual  two  step  counts,  however,  do  not  verify  the  square  of  the 
one  step  count  matrix.  Counts  above  ten  and  below  ten  scatter  at  random 
throughout  the  array.  The  square  of  the  two  step  matrix  to  predict  the 
fourth  wave  differs  from  0.1  only  in  the  third  decimal  place. 

For  completeness,  the  three  step  count  array  and  the  corresponding 
matrices  are  given  in  Tables  5-C-113-1,  5-C113-2  and  5-C-113-3.  The  counts 


are  essentially  random  numbers  with  no  discemable  pattern.  The  square  of 
the  matrix,  which  might  be  the  prediction  of  the  sixth  wave  to  follow  a 
wave  of  known  height,  can  be  judged  by  inspection  to  be  of  no  use. 

Tables  6-1,  6-3,  6-4,  to  6-10  give  the  1000  wave  arrays  for  the  nine 
ten  state  Markov  processes  not  given  in  the  previous  tables.  The  number 
after  the  dash  corresponds  to  the  number  of  the  1000  wave  sample.  There 
are  11,  and  the  missing  6-2  and  6-11  are  given  in  4-C-2-1  and  5-C-ll-l. 

The  trend  for  the  high  counts  to  concentrate  along  the  main  diagonal  toward 
the  end  of  the  total  record  can  be  seen  in  the  data. 


TABLE  4-6  One  Step  Transition  Counts  and  Sample  Markov  Transition 
Matrix  for  Record  6. 
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TABLE  4-7  One  Step  Transition  Counts  and  Sample  Markov  Transition 
Matrix  for  Record  7. 
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TABLE  4-8  One  Step  Transition  Counts  and  Sample  Markov 
Transition  Matrix  for  Record  8. 
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TABLE  4-9  One  Step  Transition  Counts  and  Sample  Markov 
Transition  Matrix  for  Record  9. 
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TABLE  4-10  One  Step  Transition  Counts  and  Sample  Markov 
Transition  Matrix  for  Record  10. 


TABLE  4-C-2-1 , 


Summed  Transition  Counts  for  Records  6  to  10  and 
the  Difference  Between  the  Values  of  Ten  and  the 
Actual  Value. 
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TABLE  4-C-2-2.  Transition  Probability  Matrix  for  1000  Waves  and 

the  Difference  Between  it  and  0.10  for  each  Element 
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TABLE  4-C-2-4 .  The  Cube  of  the  Matrix  in  4-C-2-2  and  the  corresponding 
Differences . 
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0,1  00 
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O, 10P I 
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O. 099 
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P. 100 


priip  matrix  -  o.ioo 
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?  *  ,f *00-3.001 -0,001  0,001. 

1  *  '*,000  0.000-0.001-0.000. 
H  0,000-P. 001-0, 001. 

<5  *--,/000  0.000-0.001-0.000, 

h  *  0fooo  0.000-0.001-0.000. 
7  *  0/000  0.000-0.001-11,000. 

«  *--/ 703-0, 000-3, 001-0, pot, 
9  *  '*'-*00  0.000-0.001  0,001- 
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■P  .  00  1 « 
•  0  •  000* 
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■0 , 000* 
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■0 , 001 « 
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0,000. 
0 . 000. 
0.000 
■P.000, 
0,001 
0.000. 
0,000 
0,001. 
0,000 


0,000 
0.000 
0’.O00 
0,000 
■0  •  000 
0,000 
■0.000 
0.000 
■  0.000 
0.000 


? 

*  *  *  *  r 


TABLE  4-C22-1  Slimmed  Two  Step  Transition  Counts  for  Records 
6  to  10  and  the  Differences  Between  the  Values 
of  10  and  the  Actual  Values. 


?  step  COUNTS  1000  WAVPS 
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13 
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ft 

9 

S 

1  4 

9 

17 

* 

100 

9 

* 
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IP 
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13 

u 
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IP 

15 

* 
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0 

* 

9 

7 
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A 

ft 
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15 
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* 
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TABLE  4-C22-2  Sample  Two  Step  Transition  Probabilities  for 
Records  6  to  10  and  the  Departure  from  0.10. 


2  STEP  PROBABILITY  1000  WAVES 


t  *  *  i 

1  *  i 

1 

k  *  ^  4  *  *  4 

2 

3 

4 

5 

A 

7 

8 

9 

1  3 

i 

* 

A*  OA0 

0,120 

B.3A0 

7  B  4  H  R  4  wf 

B  .  1 23 

3,070 

■  W  W  W  B  ®  U 

0.1  20 

H  H  W  ^  M 

0.113 

^  ft  ft  A  ft  ft  J 

0.123 

0.113 

r  *  *  *  *  # 

0.090 

? 

* 

r«  1  20 

0.000 

0.050 

B  ,  1  03 

3. 1  00 

B.  110 

B.  140 

0,080 

0.080 

0.370 

* 

0*110 

0.000 

0.140 

0.150 

3.080 

B.  1  1  0 

O.09B 

0  .  BAB 

0,073 

0.110 

a 

* 

0  1  40 

0.110 

B  ,  BAB 

B.B53 

3.1  AB 

0.1BB 

0.080 
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0.113 

0.090 

5 

* 

■v  1  1  B 

0. 1  2B 

0.040 

3.070 

3.090 
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0.093 

0.0A0 

0.110 
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K 

* 

V  150 
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0,1«B 

3.100 

3,120 

0.  IBB 

B.110 

3.010 
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0,0A0 

7 

* 

o'  08B 

3,130 

0.040 

0.120 

3,150 

0. 1  00 

0.113 

0.070 

9  0^  Pt 
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<i 

* 

V  -«50 

0.170 

3.110 

0.130 

0.38B 

B  .  050 

0.353 

B.140 

0.090 
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Q 

* 

3*  TQf/| 
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B.05O 

3.103 

3.060 

M.  1  3B 

3.130 
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1 0 

it 

•»'  7ld0 

3.070 

B.1  3B 

B.3A0 

3.080 

B.340 

0.110 
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PR  OH 

1  MATRIX  -  B. 
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3 

4 

5 

7 

8 

9 

10 

i 

*-B*  r/i40 

0 .020- 

1  t  A  A  ft  t  ft 

B.020- 

it  t5?  it  *  #  i 

'3.010 

3 .320 

3.210 

B.020 

i  ****** 
0.010- 

it  *  *  *  it 

.0,310 

2 

* 

0  P20-B.O10- 

.3,010 

0.000 

0.000 

3.010 

B.M4 0-0,020-0.020- 

.0.310 

1  *  fl-B.noo  O.340  tA.t95B-JI.BPO  0 . 0 1  0-0 , 3 1  0-0 . 04(9-0 . 030  0.313 

a  *  0*340  0.010-0.040-0.050  3^,360  3.000-0,023  0,000  0,01 0-0.01  3 

*;  *  o'mB  B.  o?B-B.Bin-B.  3l0-0,0l  0  0,320-0.310-0.040  0 .030  0,020 

h  *  0*fl> 53-0,050  0.380  0.000  3,020  b’.bbb  B.B30-B. 070-0,020-0^040 

7  *-</'>2fl  3. 010.0. BIB  0,020  0.050  0 . BOB  0 . 0 1 0-0 . 030-0 , 040-0 . 0 1 0 

8  *-3*?50  B.B30  0.010  0.030-0', 020-0.050-0.053  0.040.0.010  0,070 
4  *-3*010  B.000-0.B5O  0.000-0.040  0,000  M.000  0,050  B.000  0,050 

IP  *-3*710-51.013  B. PI3P-0, 043-0. b20-b'.01  0  0.013  0.070  0.050-0.050 


TABLE  5-51  One  Step  Transition  Counts  and  Sample  Markov 
Transition  Matrix  for  Record  51. 


1  STEP  COUNTS  200  WAVES 


HMAX 


*  *  *  *  i 
1  * 


to  35.30  S«.<>«  11 


1  0  * 


2"i  po  ?  ti  31  3*  Pn  10  ?(*  ppi  pm 


1  STEP  PRfJSAHlLTTV 


O.05 
0.35 
*."5 
0.00 
0  .  MS 
0 . 1  0 
O.20 
0.05 

o.  i  o 
o.  1  0 


C.  Id  Pi, 
0  .  I 5  Pi , 
"1.291  PI, 
Pl.pO  0, 
«.05  O, 
fl.MS  O, 
P.15  Cl, 
w.05  S', 
0.05  H, 

0,00  t/l  , 


0.10 
0.20 
M.  10 

0. IS 
0.15 
0.10 
0.10 
0.10 
0.00 
0.00 


0.191 
PI.  05 
0,10 
0.10 
0.05 
0.25 
o.  is 

0,15 

0.05 

0.05 


O.ps 
0  .  0  0 
0.10 
0.00 
o.o5 

0.15 
0.15 
0.15 
0  .  ?o 

0.15 


o.15 
0.05 
0.20 
o.l  3 
o.?0 
0.05 
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o.o  o 
0.05 
0.1  5 


0.20 

0,00 

0,05 

0,15 

0,20 

0,05 
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0.15 

0,05 

0,05 


o.  1  PI 

0.05 
0.00 
0.05 
01.  1  in 

0.15 

O.05 

0.10 

0.20 

o.?0 


0.05 

0.05 

0.00 

0,05 

0.10 

0.05 

0.05 

0.15 

0.20 

0.30 


'•TV 


TABLE  5-54  One  Step  Transition  Counts  and  Sample  Markov 
Transition  Matrix  for  Record  54. 


1  STEP  COUNTS  POO  WAVES 


******* 


l  STEP  PROBABILITY 
4  5  A  7 


1  *  0.20  7, 

P  *  0.17  7.05  o, 

5  *  « .  m«5  0.75  0, 

4  *  0.17  o.po  o, 

5  *  0.P5  O.io  o, 

%  *  7.  If.’  7 .  MS  7, 

7  *  0.15  7.17  7, 

S  *  /A.  75  11.77  pi, 

9  *  PI.7S  7.75  7, 

17  *  7.75  7.27  7, 


7.75 
7.25 
7.7  5 

7.75 

7.10 

7.10 

7.10 
55.  ?5 

7.75 
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0.25 

0.75 
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0.15 
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0.05 
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0,15 
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0.05 
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0.05 

0.25 


0.05 

0.77 
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0.05 
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0.10 

o.?o 

0.10 

0.10 


0.05 

7.00 

0.15 

0.77 

0.05 

7.05 
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0.20 

0.20 
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TABLE  5-55  One  Step  Transition  Counts  and  Sample  Markov 
Transition  Matrix  for  Record  55. 
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TABLE  5-C-ll-l  Summed  Transition  Counts  for  Records  51  to  55  and  the 
Difference  Between  the  Value  of  Ten  and  the  Actual 
Value. 
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TABLE  5-C-11-2  Transition  Probability  Matrix  for  1000  Waves  and  the 
Difference  Between  it  and  0.10  for  each  Element. 


1  STEP  PPOPAfULITY  1  Mr*  WAVES 


"i.nn 
0  .  080 
0.  JTn 
i?.  1  8p 

o .  i  n  o 
o .  o  a  p 
0 , 1  o  o 

o ,  o  4  o 

f .  o  8  o 

’l.r'hfl 


MW 

51.  1  *50 
0. 1  50 
0.  ISO 
0.080 
0.110 
0.0  70 
p,08n 

0.08O 
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0  .  0  7  0 
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0.130 
0. 1  35} 
0.  ISO 
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M.  100 
0.130 

o. oao 

p. 040 
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O.jafl 
0.080 
■0  .  o  7  n 
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0.100 
O.100 
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0  0  8  0 
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0 . 050 
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0 , 0  9  0 
0.130 
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0. 1  50 
0.080 


0 . 1  0  0 
0.1  30 
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0.090 
0.110 
*.130 
0 ,090 
0,070 
0,080 
0.1  00 


0.080 
(1.O80 
0 ,090 
0,090 
0.110 
O  .  1  0  0 
0,1*50 
O.O90 
0.100 
0.130 


0.070 
0,09  0 
O.O80 
0.040 
0.080 
0,180 
0.100 
0.140 
0.  J  40 
o.lSO 


0.030 

0.OU0 

0.080 

0.09/ 

0.070 

0.07O 

0.080 

0.150 

0.200 

0.250 


P9UH  MATPIK  -  o.tOO 


★  *  *  * 

1  *  ''_?<(«  0,o*o  0,030-0,030  o.o30  0.030  «  .  0071-0  .  W30-O ,  0  30-0 . 0RO 
p  *  ■’*•'70-0.030  0.050  0.0.30  0.030-0.030  0.030-0,040-0,080-0.080 
*  *-/'?R0  0.07  o  0.030  0,030  0.040-0.050  0 . 00 /)-o  .  0  1  0-o  ,  030-0 . 030 
■i  *  -  yi '  -  p  W  0.080  0.080  0,030-0.030  0.010-0,010-0.010-0.080-0.080 
5  *  0*  *50  0.040-0. 040  0.050-0.030-0.020  0.013  0.010-0.040-0,030 
-  *-(/  030-0. 080  0.010  0,000  0.010-0. 010  0.030  O.00PI  0.080-0,030 
7  0.000-0.030  O.OOO  O.oOO  0.030-0,010  0.050  0.000-0,020 
A  *  '•*  -100.0. O80-0. 030  0,030  0.000  0.01 0-0, 032-0,010  0,040  0.050 
5  *  '»00-:i.O4M-r,.  040-0,  '*80-0.030  0.050-0.020  0,000  0,040  0,100 
i  *-0' ••40-0.090-0. 050-11. 080-0. 020-0. 020  0.000  0,030  0.050  0,150 


TABLE  5-C-11-3  The  Square  of  the  Matrix  in  5-C-11-2  and  the  Difference 
Between  it  and  0.10. 


PRQB  MATRIX  SmiAKED 


1  *  7'  1*6  (A.  1*5  p. 107  0,107  PI. IPS  0.798  P.1P4  0,096  7.093  0,078 
?  *  *'lPt  *.118  0.1*9  *.1*7  7.175  0.099  O.JQI  0,099  *.*87  7.075 
3  *  'I*  10«>  7.J07  M.IP4  0.109  1PP  OI.09S  0.1*3  P.098  P.78S  0.094 
a  *  *'l*3  0.1*8  0.114  P.lll  7.1*8  0.M93  0.1»4  P.094  0,089  P.P79 
S  *  •,'/1P8  0.1P4  P ,  IP9  P ,  1 0 i  0.1*3  0.1*3  7.1*0  0,097  7. *91  0.085 
s  *  */09s  0.100  0.093  0.097  0.09S  0.1PS  7  .  *  9  6  0,104  0,1*5  0.1*9 
7  *  -'iCV  P.0O4  0.099  0,1*2  *.099  0.1*?  p.099  0,099  0.104  0.1*1 
*  *  *'*95  0.098  P.095  0,093  0.095  0.1*3  0.098  0,103  *,1*7  0.114 
o  *  *'*94  0.P83  0 .089  0.088  71.097  0.1*3  *.100  0.1*4  *,1  19  0.136 
"  *  -/*93  *.*R?  0,083  P.085  0.093  0.1*3  *.*97  0.1*8  *,118  0.140 
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TABLE  5-C-11-4  The  Cube  of  the  Matrix  in  5-  C-ll-2  and  the 
Corresponding  Differences. 


PKd'l  MATPtX  Cl'PED 

'  2  3  4  5  8  7  A  9  10 

1  *  'a'icti  0.103  0.102  0.102  0.101  0.100  0,100  0.P99  0.P197  0,09a 

?  *  r?'  1  ci 2  0.103  0.103  0.103  0.101  0.099  0.101  0.098  0.098  0.093 

1  *  0*1^1  0.102  0.1 0?  0.101  0.101  3.09«>  0,10(3  0,099  0,097  3.398 

u  «  '’'llM  0.10a  0.103  0,10j  0.101  0.099  0.100  0,099  0,09ft  0.094 

S  *  0r1«l  0.102  0.102  0.102  51.1(11  0.099  0.100  0.099  3.099  0.095 

*  *  'i'  1  >00  0.398  0.099  0.099  3,100  0.100  0.100  0,100  0.10?  0.103 

7  *  0*1  00  0.100  0,100  0,100  3,100  0.100  0.1M0  0,100  0.100  0,  100 

n  *  '/-(lO  ,7|.a9R  0.H9A  0,099  0.099  0.100  0.100  0.101  0,102  0.109 

9  *  >i'  ->98  0.00*,  0.09(<,  0,098  0.099  0.101  0.099  0.10?  0,105  0.109 

1  '•  *  '*'c-9fl  0.095  0.095  0.095  3.098  0.101  0.099  0.102  0.J08  0.111 


P50H  MAT9IX  -  H.100 

1  2  3  9  5  9  7  8  910 

i  *  •''>01  0.003  0.002  0.002  3^001-0.000  0.002-0.001-0.003-0.009 

?  «  P.093  0.003  0.003  2,001-0.001  0.001-0.002-0.009-0.007 

3  *  V-01  0.00?  0.002  0.001  7i.00i-0.00i  0.000-0. 001-0. n03-0. 00a 

a  *  :/>0i  M.pwa  0 .003  0.00  5  0.001-0.001  0.002-0,001-0.004-0.008 

■>  *  0,W01  0.002  0,002  0.002  2.001-0.001  0.002-0,001-0.002-0.005 

fc  *-0<’>00-0. 002-0. 001-0. am-0, 000  0.000  —  f3. 202  0.000  0.002  0.003 
7  *-3,*00  0. 0 00-0, 00U  —  0,000-3.000  0.300-0.300  0,003  0,000  0.000 
M  4- A* 001-3, 002-0.002-0, 032-0. 001  0.300-0.200  0.001  0,002  0.004 
9  *-'/  *02-0. 304-0. 304-3, 009-3. 002  0.001-0.031  0,032  0.005  0,038 
1  i  *-0^002-0.095-0.335-0,005-2.002  0.001-0.031  9.002  0.038  0.011 
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TABLE  5-C112-1  Summed  Two  Step  Transition  Counts  for  Records 
51  to  55  and  the  Difference  Between  the  Value 
of  10  and  the  Actual  Value. 
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TABLE  5-C112-2  Sample  Two  Step  Transition  Probabilities  for 
Records  51  to  55  and  the  Departure  from  0.10. 
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TABLE  5-C113-1  Summed  Three  Step  Transition  Counts  for  Records 
51  to  55  and  the  Difference  Between  the  Value 
of  10  and  the  Actual  Value. 
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TABLE  5-C113-2  Sample  Three  Step  Transition  Probabilities  for 
Records  51  to  55  and  Departure  from  0.10. 


3  STEP  PROBABILITY  1  030  WAVES 


1  *  UP  a.)?*  0.t5"  0.070  0„0AO  pi. IIP  w.ttfl  0.110  0.O50  0.090 
J  k  1  00  O.090  0.040  0.100  P.l  2P  0.130  PI. 083  0,113  R.t20 

3  *  «'?90  c? . i sp?  o.i?o  0.103  0.110  0.0*0  »,H3  0,090  p.pAS)  0 . pop 

a  *  '  imp  0.120  0.09O  0,080  0.120  0.110  0.062  0.070  P.130  3.120 

5  *  ’*123  0.100  0,050  0,050  7.150  0.120  0.140  0,090  0.1't0  0.040 

t  *  o' 3  93  0.05P  3.083  0,130  0.070  0.0P0  0.130  0.130  0,090  0.143 

7  *  •if’.<S0  0.050  0.100  0.17  0  0.1O0  0.0 AP  0.07  0  0,110  P.140  0.090 

«  *  '-'*790  0.1P0  0.070  0,120  0.120  0.09M  0.100  0,110  0.100  0.100 

?  *  -'*120  0.O70  0.120  0.110  0.0A0  0.100  0.120  0.130  0.060  0.090 

10  *  '/no  0,100  0.130  0.130  0.070  0.110  0.030  0,090  O.110  0.120 


PROA  MATRIX  -  0.100 


1  *  0*010  0.020  0.050-0.1130-0.020  0.010  0.010  0 . 0 1  0-0 , 050-0 . o i  o 

2  *-3*723  0.030-0.010-0,060  0.000  0.020  0,030-0.020  0.O10  0.020 

3  *. 0**010  0.050  0.020  0,000  0.010-0.040  0.013-0.010-0,020-0.010 

4  *  3*730  0.02O-0.01 0-0,020  0.020  O.oi «— 1, 043  —  0. 030  0,030  0.020 

5  *  0*720  0.000-0.050-0,050  0.050  0.020  0.040-0.010  0,040-0,050 

h  *-0*710-0.05 O-0.020  0. 030-0. 030-0. Ol 0  0,030  0.030-0,010  0.O4O 
7  *-vl *?2P-0.m 40  0.000  0.070  0.000-0.020-0.030  0,010  0.040-0,010 
m  *.7*71(7  0.000-0.030  0,(120  O.020-0.O1O  3,003  0,010  0.0PO  0,000 
9  •  7*  720  —  0 . 030  0.020  0.010-0.020  0.000  O.02O  0.030-0.040.0.010 

0  *  0*010  0.P00  0.030  0,030-0.1030  0,010-0.070-0.010  0.010  3.020 


TABLE  5-C113-3  The  Square  of  the  Matrix  in  Table  5C113-2  and 
the  Differences  Between  it  and  0.10. 


PWOfl  MATRIX  SOIJAHFO 
q  s  h  7 


10 


*  *  ********1 


1 

* 

O'  ORf, 

0.103 

0.101 

0.101 

P.  1  0P» 

0.09ft 

0.101 

0.100 

0.100 

0.101 

p 

* 

o'  p98 

0.09ft 

0.100 

0.103 

■  A  ,  Pi  9  7 

0.099 

0.102 

0.103 

0.102 

0.101 

1 

* 

o'  f, 9ft 

£ 

0.1  04 

0.099 

•1 .  0  9  ft 

0.103 

0.100 

0.101 

0.098 

0.104 

0.099 

■i 

* 

v  1  01 

0. 1  Op 

o ,  ioi 

0,09b 

Pl.p9ft 

0.101 

0.103 

0.101 

0,099 

0.100 

s 

* 

Y  i  00 

0,093 

0.097 

P.1O0 

0.101 

0.100 

0.108 

0.105 

0.101 

0.095 

*1 

* 

Y  099 
» 

0,09ft 

0.10! 

0.108 

0.099 

0.09ft 

0.092 

0,101 

0.103 

0.102 

r 

* 

0  101 

0.101 

0 . 099 

0.099 

0,102 

0.099 

O.098 

0,100 

0.101 

0.100 

•i 

♦ 

o'  i  00 

o'  oRfl 

0.099 

0.098 

0.099 

0.102 

0.101 

0,099 

0.100 

0, 104 

0.099 

9 

* 

0.101 

0.101 

0.103 

0.101 

0.097 

0.098 

0,101 

0.101 

0.100 

0 

* 

/'  1  op 

0. 104 

O  .10  3 

0.09  7 

0*.  098 

0.099 

0.099 

0.100 

0.09ft 

0.103 

1 

********* 

l  *.,''  ?04 

*  .  i *  '*  n  f> 
*-o,  /  -/P2 

*  Y  '.'<A\ 

*  Y  "00 

*  .  P  '  V  0  1 

* 


P 

3 

a 

5 

h 

t 

k 

9 

1  ’’ 


*-o'  ?t/ip 
*-i'»f  r»(7*p 
*  ^'r»M0 


■3.003 

•  PI,  HMO 

O.opq. 

■o.ono 

•p . 007 . 
■0.00? 
0  .  00  1 . 
■  0  .  0  0  1  " 
o.ooi 
O.004 


O.OOl 

0.0013 

,01.  VI  fll 

PI.  PI  CM' 

■  pi.003' 
0.  Pi  Ml 

■  Pi. pi  PM  ■ 
.Pi.flCia, 
0.001 
0.003 


PROW  MATRIX  -  0.100 

q  *5  ft  7 

********* 

0,000 


o.noi 

0,003 
•o.ooa 
■0.0134 
■n.030 
Pi. 008 
.0.001 
•  pi,  001 
0.003 
■0,003 


■  0,003, 
0.003 
■0.002 
0,001 
■0.001 
0.002 
0.002 
0.001 
■0.002 


■o.0oa 

■0.001 

■0.000 

0.001 

0 . 000 
■  0.002" 
■  0.001* 
0.001. 
■0  .  D03" 
■  0.001" 


0.001 

O.002 

0.001' 

0,003 

0.008 

0.008 

0.002 

0.001 

0,002 

0.001 


0.000 
0.003 
■0.002 
0.001* 
0.005 
0.001 
0.000 
0,000 
0.001 
■0 , 000* 


0.000 

0.002 

0.004- 

0,001* 

0.001" 

0.003 

0.001" 

0,004- 

0.001 

,0.003 


10 
,**< 
0.001 
0.001 
0.001 
0.000 
,0.005 
0,002 
,0.000 
,0.001 
0,000 
0.003 


112 


TABLE  6-4  One  Step  Counts  for  the  1000  Wave  Sample  from  Records 
16  to  20  and  the  Difference  Array. 
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TABLE  6-6  One  Step  Counts  for  the  1000  Wave  Sample  from  Records  26 
to  30  and  the  Difference  Array. 
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TABLE  6-7  One  Step  Counts  for  the  1000  Wave  Sample  from  Records 
31  to  35  and  the  Difference  Array. 
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TABLE  6-8  One  Step  Counts  for  the  1000  Wave  Sample  from  Records 
36  to  40  and  the  Difference  Array. 
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These  ten  by  ten  transition  matrices  show  that  it  is  virtually  impossible 
to  predict  the  height  of  the  next  wave  as  a  member  of  one  of  ten  sets  given 
the  height  of  a  wave  that  has  just  passed  as  a  member  of  one  of  ten  sets. 

The  element  of  surprize  common  to  most  accounts  of  extreme  wave  at  sea  is 
more  understandable  on  the  basis  of  this  kind  of  analysis.  In  even  a 
statistically  unchanging  seaway,  the  time  and  place  that  an  extreme  wave  will 
occur  is  essentially  unpredictable.  It  can  form  itself  a  few  tens  to  hundreds 
of  meters  away  from  a  ship  and  not  have  existed  at  all  a  few  minutes  earlier. 

To  proceed  further,  the  number  of  states  were  reduced  to  three  by  combining 
probabilities  (or  counts)  in  states  1  to  3,  states  4  to  7  and  states  8  to  10, 
which  correspond  to  the  lowest  30%,  the  middle  40%  and  the  highest  30%  of  the 
waves.  The  result  is  a  three  by  three  transition  matrix  as  shown  in  the 
first  column  of  matrices  in  Table  7.  The  upper  left  number  represents  the 
transition  of  states  1  to  3,  the  lowest  30%,  to  states  1  to  3,  again  the 
lowest  30%.  The  next  value  in  the  top  row  is  the  transition  probability 

from  states  1  to  3  to  states  4  to  7;  that  is  from  the  lowest  30%  to  the 

middle  40%.  The  last  value  in  the  top  row  is  the  transition  probability  from 

the  lowest  30%  to  the  highest  30%. 

The  next  row  of  each  matrix  shows  the  transition  probabilities  of  the 
middle  40%  to  the  lowest  30%,  the  middle  40%  and  the  highest  30%,  and  the 
last  row  is  for  the  transition  probabilities  for  the  highest  30%  to  the 

lowest  30%,  the  middle  40%  and  the  highest  30%.  If  the  waves  were  completely 

unpredictable,  the  values  in  each  row  would  be  very  close  to  0.3,  0.4  and 
0.3  for  a  large  enough  sample.  The  comer  elements  in  these  eleven  matrices 
are  the  most  interesting.  These  sample  transition  probabilities  are  graphed 
in  Fig.  19,  still  with  the  hypothesis  that  wave  height  sequences  are  a  one 
step  Markov  process.  As  in  both  the  Figure  and  Table  7,  for  the  first 
sample,  the  probability  that  if  the  wave  that  is  present  is  among  the  30% 
lowest,  the  following  wave  will  be  among  the  30%  lowest  is  0.390.  The 
probability  that  a  wave  among  the  30%  lowest  will  be  followed  by  a  wave  among 
the  30%  highest  is  0,233.  Similar  interpretations  are  possible  for  each  of 
the  four  transition  probabilities  illustrated  in  the  figure.  For  samples  2 
and  3  there  is  not  much  difference  between  the  sample  probabilities  and  0.3. 

As  the  hurricane  waves  build  in  height,  the  probabilities  that  a  high  wave 
will  be  followed  by  a  high  wave  and  a  low  wave  by  a  low  wave  increase  to 


TABLE  7  Three  State  One  Step  Markov  Transition  Matrices  for  the  Lowest 
30%,  the  Middle  40%  and  the  Highest  30%  and  the  Corresponding 
Theoretical  Two  Step  and  Actual  Two  Step  Matrices. 
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FIGURE  19  Selected  Transition  Probabilities  for  an  Assumed  Three 
State  Markov  Process. 


values  of  0.450  and  0.380.  The  probabilities  that  either  a  low  wave  will 
be  followed  by  a  high  wave  or  a  high  wave  will  be  followed  by  a  low  wave 
change  from  near  0.3  (essentially  random)  to  values  of  0.187  and  0.203. 

Of  1000  waves,  300  are  the  lowest  30%  and  300  are  the  highest  30%.  If 
the  lower  waves  had  a  probability  of  0.3,  1000  waves  are  sampled,  and  if 
the  waves  were  random  the  expected  number  of  transition  lower  waves  would 

j*  i, 

be  300  with  a  standard  deviation  of  (0.3  X  0.7  X  1000)^  (or  (210)”7) . 

u 

Sample  probabilities  of  (300  ±  p(210)  )/1000,  since  estimated  probabilities 
have  sampling  variability,  are  shown  on  the  transition  probability  scale  as 
plus  or  minus  one  to  three  standard  deviations.  Toward  the  end  of  the  full 
data  set,  as  the  hurricane  waves  build  in  height,  their  heights  clearly 
begin  to  depend  upon,  at  least,  the  height  of  the  preceding  wave. 

The  square  of  a  one  step  transition  matrix  should  give  the  two  step 
transition  matrix.  These  squared  matrices  are  also  shown  in  Table  17.  The 
transition  probabilities  are  also  graphed  in  Fig.  19  for  the  two  step 
process  as  labled.  Probabilities  of  0.318,  0.335,  0.273  and  0.277  are  so 
little  different  from  0.300  that  if  one  knew  the  height  of  a  passing 
wave  the  second  wave  to  follow  could  not  be  predicted  better  than  by  chance. 

The  actual  two  step  transitions  are  also  tabulated  and  graphed.  Most 
of  the  four  comer  values  are  within  one  standard  deviation  and  all  but 
two  out  of  forty  four  are  within  two  standard  deviations.  Knowledge  of 
the  height  of  a  passing  wave  is  of  little  value  in  predicting  the  height 
of  the  second  wave  to  follow  it. 

It  is  also  possible  to  simplify  the  prediction  but  base  it  more 
accurately  on  the  heights  of  the  initial  wave.  As  in  Figure  20,  given  that 
the  initial  wave  is  in  state  1  (that  is  one  of  the  lowest  10%)  the  sample 
probability  that  the  next  wave  will  be  states  1  to  3  or  8  to  10  can  be 
found.  If  a  very  low  wave  has  passed, the  chance  that  the  next  wave  will 
be  among  the  30%  lowest  exceeds  0.4  and  that  it  will  be  among  the  30% 
highest  is  under  0.2.  Similarly  if  a  very  high  wave  has  passed,  it  is 
better  than  even  the  next  wave  will  be  among  the  30%  highest  and  under  0.2 
that  it  will  be  among  the  30%  lowest. 

Even  1000  wave  samples  are  somewhat  irregular  as  in  Fig.  19.  Nine 
overlapping  3000  wave  samples  can  be  formed  by  averaging  the  sample 
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b  TRANSITION  PROBABILITY 


IOOO  WAVE  SAMPLE  NUMBER 


FIGURE  20  Sample  Transition  Probabilities  from  States  1  and  10  to 
States  1  to  3  and  States  8  to  10. 


probabilities  by  three  in  Table  7.  For  the  four  comers  of  the  matrices  that 
result  the  appropriate  transition  probabilities  for  a  one  step  Markov  and  for 
the  actual  two  step  transition  are  shown  in  Fig.  21. 

Whether  or  not  a  Markov  process  adequately  represents  the  sequence  of 
successive  crest  to  trough  heights  during  Camille  is  difficult  to  judge. 

There  are  other  ways  to  predict  the  evolution  of  a  time  series  that  depend  on 
more  of  the  past  behavior  of  the  waves  than  on  the  height  of  the  wave  that 
just  passed.  For  the  present  purpose,  however,  the  analysis  shows  that  the 
second  wave  to  follow  a  particular  wave  is  nearly  independent  of  the  height 
of  that  wave  and  that  the  height  of  the  first  wave  to  follow  a  particular 
wave  is  not  random  in  the  sense  required  for  a  random  sample. 

The  two  hundred  wave  samples  that  are  the  basis  for  the  analysis  of  the 
Camille  wave  data  are  not  the  equivalent  of  200  independent  samples  from  a 
pdf  that  would  be  definable  in  terms  of  either  statistics  of  the  sample  or  the 
spectrum  of  the  wave  in  terms  of  moments  of  the  spectrum.  The  equivalent 
number  of  independent  wave  heights  is  surely  less  than  200,  at  least  toward 
the  end  of  the  55  sets  of  data.  It  may  be  somewhat  larger  than  100  because 
the  dependence  of  the  next  wave  on  the  previous  wave  is  weak.  Since  these 
sample  transition  probabilities  are  a  function  of  sample  number,  the  effective 
number  of  independent  observations  changes  from  one  sample  to  the  next  and 
may  decrease  as  the  sample  number  goes  from  1  to  55.  The  lack  of  independence 
will  have  further  repercussions  in  attempting  to  interpret  the  data. 
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SAMPLE  CUMULATIVE  DENSITY  FUNCTIONS 


Introduction.  One  of  the  ways  to  display  a  sample  of  crest  to  trough 
ocean  wave  heights  is  by  means  of  the  sample  cumulative  density  function,  or 
a  variation  thereof.  A  plot  of  the  probability  that  h  will  exceed  a  given 
value  on  a  log  scale,  labled  in  antilog  values,  as  a  function  of  h  on  a 

linear  scale  is  often  used,  frequently  in  normalized  form  as  in  Fig.  18. 

2 

If  the  horizontal  scale  of  the  plot  is  changed  to  h  instead  of  h,  the 
theoretical  Rayleigh  cdf  becomes  a  straight  line. 

One  might  think  that  either  of  these  two  ways  to  display  the  data  from 
a  sample  provide  useful  information  about  the  high  wave  tail  of  the  distri¬ 
bution.  At  h  equal  to  zero  the  data  and  the  theory  start  out  at  one.  The 
first  90%  of  the  data  occupy  the  first  decade  of  the  probability  scale;  the 
next  9%  occupy  the  second  decade;  the  next  0.9%  the  third  decade;  and  next 
0.09%,  the  fourth,  and  so  on. 

For  a  200  wave  sample,  the  highest  wave  falls  at  the  anti log  point  0.005. 
It  is  difficult  to  judge  how  well  the  first  90%  of  the  data  fit  the  theory 
because  of  the  distortion  of  the  scales.  The  points  for  such  plots  usually 
fall  well  below  the  theoretical  transformed  Rayleigh  shape  with  actual 
sample  exceedance  probabilities  often  a  factor  of  two  or  more  smaller  than 
the  theoretical  for  the  last  few  points  of  the  plot.  It  is  then  tempting 
to  conclude  that  the  Rayleigh  distribution  does  not  fit  the  data  well  for 
high  waves.  This  was  the  conclusion  reached  in  our  previously  cited  unpub¬ 
lished  study  for  DTNSRDC .  Such  a  conclusion  is  no  longer  obvious,  as  the 
following  analysis  will  show,  because  of  certain  inherent  properties  of 
sample  cdf's  and  because  the  sample  of  wave  heights  is  not  a  true  random 
sample . 


Properties  of  Sample  CDF’s.  A  sample  cdf,  Fn(x),  is  defined  to  be 


F  (x)  =  —  (number  of  X.  less  than  or  equal  to  x) 
n  n  l 


(107) 


where  X.,  X_, . X  is  a  random  sample.  The  function,  F  (x),  assumes  the 

x  £  ti  n 

values  0,  1/n. . .2/n. . .n/n  and  can  be  graphed  in  a  stair  step  pattern. 

Under  these  conditions  it  can  be  proved  that 


P  Fn(x)  =  |  =  [|j)(F(x))k  1  -  pw]n  ’  K 


for  k  =  0,  1 _ n  (108) 
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If  one  knew  the  population  cdf,  F(x),  then  in  principle  Fn(x)  for  a 
particular  x  can  have  any  of  the  above  fractions  as  values  from  zero  to 
one.  It  can  also  be  shown  that 


e  (Fn(x)J  =  F (x) 


and  Var(F  (x))  =  i-  F(x)  [l  -  F(x)] 


(109) 


(110) 


Everything  appears  to  be  straight  forward  and  clear  from  the  sampling 
theory  point  of  view,  but  the  actual  situation  is  deceptive  for  typical 
sample  sizes  over  the  full  range  of  definition  of  Fn(x). 

An  Analysis  of  the  Consequences  of  the  Properties  of  Sample  CDF's 
Although  the  following  analysis  applies  to  any  cdf,  it  will  be  applied  to 
wave  heights  for  illustration.  Assume  that  wave  heights  are  in  fact 
Rayleigh  distributed  and  that  Eq  is  exactly  known.  Then  a  normalized  variable 
given  by 


X  =  h/2(Eo)  2 

would  have  the  cdf  given  by 


(111) 


F (x)  =  1  -  exp(_x2/2) 


(112) 


Suppose  that  we  have  a  true  random  simple  of  200  values  of  X^.  Also  to 
simplify  analysis,  let  x  Q05  be  the  value  of  x  in  (112)  that  makes  F(x) 
equal  to  0.005,  that  is 


x  oos  =  “2  £l(l  ~  -005)  (113) 

the  numbers, 

X  .005  X.010  X  .015 -  Xp -  X0995 

can  then  be  computed.  For  each  value  of  the  X^ 


if 

0  <  Xi  —  X  .005 

let  X^  equal  x  numerically. 

if 

x.005  <  Xi  -  X.010 

let  equal  x  numerically 

if 

x  ,  <  X.  <  x 
p-1  i  -  p 

let  X^  equal  x^  numerically. 

up  to  x  ggj.  on  the  right  hand  side  of  the  inequalities  and  if 
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x  nnr  <  X.  <  00  let  X.  equal  its  actual  value. 

.995  1  l  M 

The  odds  are  virtually  inf initessimal ,  (200!  )/(200)  that  there  will  be 

exactly  one  value  of  X^  in  each  of  the  200  intervals  defined  above,  and 
consequently  the  sample  cdf  will  depart  in  some  way  from  the  population  cdf. 


Figure  22  shows  what  can  happen  for  low  values  of  X^,  or  as  transformed, 

x  .  The  circled  points  are  the  first  12  values  of  x  .  The  numbers  are 
P  P 

probabilities  times  1000  from  (108)  for  x  =  x^ .  If  one  had  the  luxury  of 

1000  random  samples  of  200,  for  about  367  samples  there  would  be  no  x  in 

200  P 

the  sample  for  x  qq  (numerically  ((0.995)  )  x  1000).  There  would  be  369 

jumps  to  F  (x)  =  0.005,  there  would  be  182  jumps  to  F^(x)  =  0.010,  62  to 
0.015,  15  to  0.020  and  3  to  0.025.  Numbers  have  been  rounded  to  the  nearest 
integer  and  adjusted  to  sum  to  1000.  Although  theoretically  possible,  there 
would  be  less  than  1  jump  in  2000  for  any  value  greater  than  0.025.  For  736 
times  in  1000  the  sample  cdf  will  be  less  than  or  equal  to  the  population  cdf 
for  the  first  circled  point. 


For  the  second  circled  point,  for  134  times  there  will  still  be  no  jump 
and  so  on  reading  the  values  for  increasing  F(x).  For  405  times,  the  sample 
cdf  will  be  below  the  circled  point.  For  677  times  it  will  be  at  or  below 
the  circled  point.  For  only  323  times  will  Fn(x)  exceed  F(x). 

For  the  next  ten  points  for  a  total  of  12,  where  F(x)  reaches  0.060,  the 
sample  cdf  has  a  greater  probability  of  being  below  the  theoretical  value  than 
it  has  of  being  above  it. 


•  A  given  sample  cdf  can,  in  principle,  pass  through  every  one  of  the 
plotted  points  (and  more)  as  long  as  it  increases  monotonical ly  or  stays 
constant,  but  most  of  them  start  out  below  F(x)  and  tend  to  stay  below  F(x) 
at  the  beginning  of  the  plot  of  F  (x) • 

Fig.  23  shows  a  simplified  version  of  the  previous  figure.  The  points 
are  the  values  of  F(xp),  and  the  circled  points  have  three  numbers  plotted 
near  them.  The  upper  number  is  the  chance  times  1000  that  F^(x)  lies 
above  the  plotted  point;  the  middle  one  is  the  chance  times  1000  of  being 
at  the  plotted  point  and  the  lower  one  is  the  chance  times  1000  of  being 
below  the  plotted  point. 
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The  high  end  of  the  sample  cdf  is  upside  down  and  backward  relative  to 
the  low  end  as  shown  in  Fig.  24.  The  chance  that  it  will  be  above  the 
population  cdf  exceeds  the  chance  that  it  will  be  below  it.  The  last  value, 
which  by  definition  is  plotted  at  Fn(Xj  qqq)  =  1,  will  occur  before  or  at 
Xp  equal  to  about  3.2  for  367  out  of  1000  times.  The  numbers  across  the 
top  are  the  times  in  1000  that  Fn(x  )  reaches  the  value  1  at  the  plotted 
point.  The  numbers  by  some  of  the  points  are  the  times  in  1000  that  the 
sample  cdf  lies  above,  at,  or  below  the  population  cdf. 

For  a  collection  of  random  samples  known  to  be  from  the  normalized  pdf, 
more  than  half  of  the  sample  cdf's  will  be  below,  or  at,  the  population  cdf 
for  small  values  of  the  random  variable.  More  than  half  will  also  be  above, 
or  at,  the  population  cdf  for  high  values  of  the  random  variable.  Over  the 
entire  range  of  the  X^,  those  sample  cdf's  that  start  out  below  and  end  up 
above  the  population  cdf  would  be  most  frequent  (more  than  25%) .  Those 
that  start  out  above  and  end  up  below  would  be  least  frequent,  (less  than 
25%) .  Those  that  either  start  out  above  and  end  up  above  or  start  out  below 
and  end  up  below  would  be  about  equally  divided  among  the  remainder. 

The  sample  cdf  in  (108)  has  a  symmetric  binomial  distribution  about  an 
expected  value  of  0.5  only  for  k  =  n/2  which  is  the  median,  corresponding 
to  x  equals  1.1774  for  the  normalized  pdf.  The  mode  is  at  x  equals  1.  A 
sample  cdf  has  an  equal  chance  of  being  above  or  below  the  population  cdf 
near  Fn(x)  =  0.5. 

From  the  above  analysis,  a  random  sample  will  tend  to  be  deficient  in 
samp-le  values  at  both  the  low  end  of  the  range  of  definition  of  the  random 
variable  and  at  the  high  end  of  its  range  of  definition.  With  too  few 
sample  values  for  low  and  high  values  of  the  random  variable,  there  must 
be  too  many  in  some  intermediate  range  between  the  low  and  the  high  sample, 
values  and  the  sample  cdf  will  rise  more  rapidly  over  this  range.  The 
consequence  is  that  the  histogram  that  can  be  constructed  to  compare  to  the 
population  pdf  will  very  often  be  lower  than  the  values  computed  from  the 
population  pdf  at  the  outer  ranges  of  the  distribution  and  higher  in  the 
middle.  The  result  could  be  a  sample  pdf  that  resembled  the  dots  in  Fig. 

17  whereas  in  fact  the  sample  came  from  a  population  represented  by  the 
solid  curve. 
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It  appears  that  many  texts  that  describe  the  use  of  the  Chi  Square  test 

of  goodness -of -fit  recommend  pooling  a  number  of  class  intervals  at  the  low 

and  high  end  of  the  sample  distribution,  thus  giving  up  some  degrees  of 

2 

freedom,  before  calculating  x  •  If  this  is  not  done, the  usual  consequence 
is  a  number  of  large  values  in  the  sum  used  to  compute  Chi  Square  that  lead 
to  the  rejection  of  the  hypothesis  that  the  sample  came  from  the  hypothesised 
pdf. 


Non-Random  Samples.  The  samples  of  200  consecutive  waves  have  been  shown 
not  to  satisfy  the  definition  of  a  random  sample.  Each  sample  might  still 
have  come,  for  practical  purposes,  from  a  Rayleigh  pdf  in  the  sense  that 


f(Xl,  x2,...xn)(xr  X2*  x3 . »  Xn3 

practical  purposes,  something  like 


as  on  page  79  may  take  a  form,  for 


f ,  ..  (x. ,  x_,  x_, . . . . ,  x  ) 

(x  . ,  x  ,  x  ,)1  2  3’  ’  n 

p-1  p  p+1 

and  this  is  turn  may  have  a  marginal  pdf  that  is  Rayleigh.  Successive  equally 
spaced  points  in  a  time  history  are  strongly  correlated,  but  nevertheless 
this  pdf  can  be  defined  and  studied. 


The  difficulties  arise  in  testing  whether  or  not  the  sample  comes  from 
a  given  population  and  in  trying  to  study  the  extreme  values  of  the  sample. 
The  concept  of  an  effective  number  of  independent  points  given  a  sample  of  n 
values  is  thus  useful . 


Suppose  that  by  Monte  Carlo  methods  a  truly  independent  sample  of  100 
random  variables  is  drawn  from  the  normalized  Rayleigh  pdf.  Then,  for 
simplicity,  add  a  small  random  positive  or  negative  perturbation  to  each  of 
the  to  produce  a  non-random  sample  of  200.  Given  this  doctored  sample  of 
200  values,  a  statistician  unaware  of  how  it  was  produced  might  proceed  to 
analyse  it  as  if  it  were  a  random  sample  of  200  values.  The  sample  cdf  would 
look  a  bit  strange  because  it  would  jump  twice  for  values  of  Xi  that  were 
close  together  but  the  statistician  might  not  spot  this  peculiarity. 

From  the  previous  results,  the  chance  of  not  getting  a  value  of  X^^  less 

than  x  was  (0.995) 200  or  0.317  for  a  sample  of  200.  For  our  fudged 
,  0u5  inn 

sample,  it  is  (0.995)  or  0.606.  The  corresponding  values  for  the  next 
three  points  (200  versus  100)  are;  0.134  and  0.366,  0.048  and  0.221,  and 
0.017  and  0.133. 
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The  chance  of  not  seeing  a  high  wave  corresponding  to  an  exceedance 
probability  of  0.005  has  almost  doubled.  The  true  situation  may  not  be 
quite  that  bad,  but  it  is  easily  seen  that  the  lack  of  a  random  sample 
causes  difficulties  that  are  not  considered  in  most  of  the  papers  on  the 
subj  ect . 

Both  the  Chi  Square  test  of  goodness  of  fit  discussed  on  page  80  and 
the  distribution  of  the  sample  cdf  defined  by  (108),  (109)  and  (110)  are 
for  large  samples.  How  large  a  large  sample  ought  to  be  is  not  well  defined. 
For  a  p  of  0.995  and  a  random  sample  of  200,  the  distribution  of  F(199/200) 
is  skewed  and  the  sample  cdf  has  only  a  small  chance  of  having  the  appropriate 
value.  To  obtain  a  realistic  sample  cdf  might  require  a  much  larger  random 
sample,  say,  one  ten  times  greater.  Then  ^^iggo^x^  would  still  be  0.995 
and  VAR(F  (x))  would  be  one  tenth  of  the  value  for  a  sample  of  200. 
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FIGURE  23  Properties  of  the  Sample  CDF  for  Values  of  x  Up  to  a  Value  Corresponding  to  F(x  ) =0.245 
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FIGURE 


COMPUTATIONS  BASED  ON  THE  RAYLEIGH  DISTRIBUTION 


Three  models  have  been  suggested  as  possible  reasons  for  the  departure 
of  crest  to  trough  wave  heights  from  the  Rayleigh  pdf.  Two  have  to  do  with 
variation  of  the  envelope  between  a  crest  and  the  following  trough.  Given 
the  required  spectral  moments,  the  modifications  to  the  Rayleigh  pdf  would 
be  determined  by  the  theoretical  analyses  given  above.  The  third  requires 
an  assumption  about  the  physical  behavior  of  the  higher  waves  in  a  time 
history  as  they  break.  Each  of  these  three  models  introduces  additional 
degrees  of  freedom  for  fitting  a  theoretical  pdf  to  a  set  of  observations  of 
crest  to  trough  wave  heights.  With  more  freedom  as  to  possible  forms  for 
the  pdf,  it  should  be  expected  that  a  better  fit,  in  some  sense,  might  be 
obtained. 

Is  it  possible  that  the  Rayleigh  pdf  does  fit  the  significant  wave 
height  part  of  the  actual  crest  to  trough  wave  heights  if  the  pdf  is  based 
on  the  variance  given  by  the  area  under  the  spectrum?  The  data  presented 
so  far  shows  that  the  significant  wave  height  computed  from  the  crest  to 
trough  heights  is  nearly  always  less  than  the  value  computed  from  the  area 
under  the  spectrum. 

This  may  be  a  result  of  the  sampling  problem  that  has  been  described 
in  the  preceding  section.  As  shown  by  the  work  of  Spring  (1978),  even  the 
additional  freedom  to  pick  the  value  of  the  one  available  Rayleigh  parameter 
does  not  produce  agreement  between  the  theory  and  the  data  for  the  goodness- 
of-fit  test  that  was  used.  If  any  of  the  three  models  described  above  are 
important  the  actual  shape  of  the  pdf  would  change. 

The  shape  of  the  pdf  and  the  cdf  of  the  Rayleigh  distribution  is 
completely  determined  by  one  parameter.  Figure  25  shows  a  set  of  such  cdf's 
labled  with  the  appropriate  significant  wave  heights.  A  departure  of  the 
observations  from  the  cdf  might  imply  either  that  the  theory  is  at  fault  or 
that  the  lack  of  an  independent  random  sample  has  introduced  added  complexities 

A  variant  of  the  Rayleigh  distribution  given  by 


This  cdf  provides  two  parameters  instead  of  one  with  which  to  fit  the  data. 

The  pdf  is  given  by 

f(h)  =  abh^”*  e”a^  for  0  <  h  <  00  (115) 

and  zero  otherwise.  Fig.  26  illustrates  how  the  Rayleigh  cdf  can  be  modified 

* 

by  lowering  the  curve  for  h  <  h^  and  raising  it  for  h  >  h^ .  There  are  no 
physical  or  theoretical  reasons  for  such  an  assumed  form  for  this  family  of 
pdf's  other  than,  perhaps,  an  improved  empirical  fit.  The  model  actually 
mimics  the  kinds  of  sample  cdf's  that  can  result  from  a  typical  sample  of 
crest  to  trough  wave  heights. 

The  Rayleigh  pdf  is  a  good  starting  place  to  try  to  study  the  actual 
distributions  of  crest  to  trough  wave  heights.  If  consistent  systematic 
departures  from  the  Rayleigh  can  be  demonstrated  to  occur  for  wind  seas,  this 
could  be  the  first  step  toward  obtaining  an  improved  pdf. 

To  investigate  the  possibility  of  consistent  departures  from  the  Rayleigh 
pdf  so  as  to  provide  guidance  in  the  interpretation  of  results  to  be  obtained 
from  the  preceeding  theories  the  wave  height  data  have  been  processed  in  a 
number  of  different  ways.  Given  the  value  of  Eq  in  (1),  the  pdf  is  completely 
determined. 

The  average  wave  height  is,  of  course,  given  by. 

h  =  I  h  f(h)dh  =  2.5066  (E  )**  (116) 

mean  Jo  o 

The  value  of  a  such  that 

fCO 

f(h)  dh  =  \  (117) 

a. 

yields  the  significant  wave  height,  or  the  average  of  the  heights  of  the  one 
third  highest  waves . 

R"  I  =  3  I®  h  f (h)dh  =  4.0046  (E^*5  (118) 
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Or  vice  versa. 


Examples  of  the  Two  Parameter  Cumulative  Distribution  for  the  Values  Shown 


Other  quantities  all  based  on  the  Rayleigh  pdf  such  as  the  average  of 
the  heights  of  the  one  tenth  highest  waves,  and  the  expected  value  of  the 
highest  wave  in  200  can  be  found.  Also  that  highest  wave, such  that  for  5% 
(95%)  of  a  large  number  of  samples  of  size  N,  the  highest  wave  will  be  less 
(greater)  than  that  highest,  can  be  found  (Longuit-Higgins  (1952)). 

All  of  these  properties  of  the  pdf  are  represented  by  some  constant  times 

^  % 

Eq.  Each  therefore  provides  a  way  to  estimate  E^. 

If  the  sample  is  known  to  have  come  from  a  pdf  with  a  known  analytical 
form  but  with  an  unknown  parameter  (or  unknown  parameters)  then  the  way  to 
proceed  is  well  known.  However,  if  the  problem  is  to  find  out  how  the  data 
differ  in  some  subtle  ways  from  the  theoretical  pdf,  the  way  to  proceed  is 
not  as  clearly  defined.  If  the  additional  parameters  provide  a  family  of 
curves  that  vary  smoothy  and  equal  the  Rayleigh  for  zero  values  of  the  new 
parameters,  one  might  always  expect,  but  not  necessarily  actually  have,  a 
better  fit. 

The  data  from  Hurricane  Camille  are  obviously  not  from  a  stationary 
process.  Nor  are  the  successive  wave  heights  independent  (or  uncorrelated). 
They  key  requirement  for  the  analysis  of  time  series,  namely  stationarity  in 
the  time  series  sense,  is  missing.  Even  if  the  time  series  were  stationary 
in  the  time  domain,  successive  wave  heights  are  not  independent.  The  luxury 
of  an  independent  random  sample  is  not  available. 

Most  investigators  of  the  statistical  properties  of  wave  records  have 
not  considered  these  complications.  This  approach  has  not  been  followed  in 
this  study.  It  will  not  be  assumed  that  successive  samples  of  N  wave  heights 
are  from  a  stationary  process  and  that  within  this  sample  the  wave  heights  are 
independent . 

The  average  wave  height  can  be  estimated  in  many  different  ways,  implying 

h 

the  possibility  of  different  values  of  E^  and  The  first  is  from  simply 

averaging  all  the  wave  heights  and  using  equation  (116)  .  A  second  would  be 
to  use  the  variance  of  the  wave  record  computed  from  the  area  under  the 
estimated  spectrum.  The  data  in  Tables  1,  2  and  3  provide  a  way  to  compute 
Eq,  the  area  under  the  spectrum,  from  the  tabulated  values  of  RHS.  This  value 
can  then  be  used  to  compute  the  mean  wave  height  from  (116) . 
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A  third  way  to  estimate  the  average  wave  height  is  to  find  the  maximum 
likelihood  estimate  of  Eq.  For  an  independent  sample  of  size  N,  the 
probability  of  drawing  that  sample  is 


n  h. 


L(h^..,hn)  =  f(h1>h2, . .  .  ,h  )  =  tt  exp  (-h./8E  ) 

1  o 


(119) 


The  logarithm  of  this  likelihood  function  is 
n  h.  _ 

£n  L(h1,...,hn)  =  [  In  -i  -  In  Eq  -  hf/8EQ  (120) 


This  function  is  maximized  by  differentiating  it  with  reference  to  Eq 
and  setting  the  derivative  equal  to  zero. 


The  result  is  the  maximum  likelihood  estimate  of  E 

oMLE 


EoMLE 


(121) 


This  estimate  of  Eq  would  be  unbiased  if  the  sample  were  an  independent 
random  sample  since 

'W  ■(”•••  fk  1  hi<4T  eXP(-  1--/SE 0))dh1(...dhn  -  E  (122) 

'  o  J  o  1  o 

It  follows  that  the  best  estimate  of  h  may  not  be  the  average  of  the 
values  of  h,  but  the  value  computed  from 

h  =  2.5066  (E^g)*5  (123) 


The  fourth  and  last  way  that  the  mean  wave  height  can  be  estimated  is 
to  find  the  best  fitting  value  for  Eq  for  the  cdf  over  the  full  range  of 
heights.  The  use  of  a  limited  range  of  heights  as  in  Pierson  and  Salfi((1982) 
(unpublished))  is  misleading. 

The  sample  of  size  N  is  first  ordered  from  the  lowest  to  the  highest  wave 
such  that 


h,  <  h_  <••  <•• 


<  h.  .  <  h. 

J-l  3 


<■<■  I-n 


(124) 


The  sample  cdf  is  then  given,  as  defined  previously,  by 


The  difference  between  it  and  the  theoretical  cdf  is 


Dj  =  jf  "  d  -  exp(-  h^/8EQ)) 


(126) 


The  best  fit  value  of  E  is  that  value  that  will  minimize 

o 

n  - 

D  =  T  D  (127) 

1  3 


A  computer  program  has  been  developed  that  searches  for  that  value  of  Eq 
that  minimizes  D. 


The  use  of  the  significant  wave  height  became  popular  before  the  avail¬ 
ability  of  digital  wave  recording  methods  and  spectral  analysis.  Often  the 
one -third  highest  waves  were  determined  by  inspection  of  a  graph  of  the 
waves  on  chart  paper.  The  result  was  a  considerable  saving  of  time  in  data 
reduction. 


From  the  properties  of  the  cdf  given  previously,  in  a  random  sample  of 

200,  the  expected  number  of  waves  that  exceed  F(2/3)  is  200/3,  or  66.66",  or 

u  _ 

67.  The  standard  deviation  is  (400/9)  2,  or  (20/3),  or  6.66,  or  7.  The 
significant  height,  since  one  is  not  sure  that  one  third  of  the  waves  in  a 
random  sample  actually  comprise  all  values  belonging  to  the  set  of  the  one 
third  highest  waves  in  the  population,  need  not  be  the  average  of  just  the  one 
third  highest.  For  an  ordered  sample  this  provides  an  estimate  of  the  variability 
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of  this  quantity. 

A  similar  analysis  for  the  one  tenth 
expected  value  and  (200( .1) ( .9))^  =4  so 


(128) 


highest  waves  yields  20  for  the 
that 


200 

1 

1  200 

I  h 

177  1 

16  185  1 

(129) 

These  equations  do  not  consider  that  the  sample  may  not  be  random.  If 
the  effective  sample  size  were,  say,  100,  disguised  at  a  sample  of  200,  the 
numbers  to  use  would  be  67  ±  9  and  20  ±  6  so  as  to  broaden  the  area  of 
uncertainty. 


On  the  basis  of  the  discussion  of  the  sample  cdf,  it  will  usually  happen 
that  less  than  20  waves  from  a  sample  of  200  will  belong  to  the  one  tenth 
highest  and  that  less  than  67  will  belong  to  the  one  third  highest. 

Each  sample  of  200  waves  was  processed  in  the  same  way  so  as  to  produce 
a  table  similar  to  Table  8,  which  is  for  records  49  and  50.  All  55  tables 
are  given  in  the  appendix. 

The  top  part  of  Table  8  for  record  49  has  15  columns  and  11  rows.  The 
first  3  columns  are  the  averages  of  the  24  highest,  the  20  highest  and  the 
16  highest  waves  as  shown  along  the  diagonal  of  the  10  by  10  array.  Should 
4  more  heights  actually  belong  to  the  one  tenth  highest,  the  value  for 
the  average  of  the  l/10th  highest  would  be  37.0.  Should  exactly  20  heights 
belong  to  the  one  tenth  highest,  as  is  usually  assumed,  then  the  value  would 
be  37.8.  Finally  should  the  sample  only  contain  16  waves,  the  value  would 
be  39.3. 

The  next  three  rows  are  the  corresponding  estimates  of  the  one  third 
highest  waves  with  values  of  29.1,  29.9  and  30.8  on  the  diagonal.  The  value, 
19.5  is  the  average  of  the  200  waves  (HBAR) .  The  maximum  likelihood  esti¬ 
mate  (HMLE)  is  19.7  for  the  mean.  The  value  for  the  best  fit  cdf  (HCDF)  is 
19.7  for  the  mean,  and  the  mean  value  based  on  the  area  under  the  spectrum 
is  20.7  (HSPT) . 

The  values  for  HMLE,  HCDF  and  HSPT  were  computed  from  estimates  of  (Eq)  2, 
which  is  given  in  the  column,  Eq.  The  column  labled  Eq  is  the  square  root  of 
Eq,  which  was  computed  to  more  significant  figures  before  rounding  and 
tabulat ion . 


Each  of  the  first  ten  columns  in  Table  8  represents  a  statistic  that 
can  be  obtained  from  either  the  data  in  the  form  of  h^,  h^, . . . .h2Q0  or  in 
the  form  of  the  estimated  spectrum.  From  each  statistic  a  statistic  for 
Eq  ,  (and  E  )  can  be  obtained  as  some  constant  times  the  value,  or  the  square 

of  the  value, that  was  obtained.  From  each  of  the  possibly  different  values 

% 

of  E^  ,  the  average  of  the  one  tenth  highest,  one  third  highest  and  of  all 

the  sample  can  be  obtained.  The  columns  thus  show  ten  different  ways  to 

obtain  the  quantity  headed  by  the  column  when  appropriate  nearly  quantities 
are  shifted  to  an  appropriate  column.  As  printed  H10P  can  be  shifted  right, 

H10M.  left,  HSIP  right,  HSIM  left  and  HMLE,  HCDF  and  HSPT  left  so  as  to 

form  three  columns  under  H10H,  HSIG  and  HBAR. 

Given  a  very  large  random  sample,  say  two  million,  from  a  Rayleigh  pdf, 
it  would  be  anticipated  that  all  of  the  values  in  each  column  would  agree  to 
about  ±  0.4%.  For  a  non-random  sample  of  200  waves,  the  values  as  computed 
by  these  different  methods  for  H10H  for  record  49  are  all,  but  one,  within 
the  range  of  uncertainty  for  that  value  of  37.0  to  39.3  ft.  The  exception 
is  the  value  obtained  from  the  spectral  area  which  is  41.0  feet. 

There  are  quite  small  differences  between  corresponding  values  in  a 
given  column  and  the  values  obtained  from  any  one  of  the  three  values,  HBAR, 
HMLE  and  HCDF,  always  with  the  exception  of  those  obtained  from  the  area 
under  the  spectrum.  Both  records  49  and  50  have  these  features.  As  a 
general  statement,  with  perhaps  an  exception  once  in  a  while,  the  55  data 
arrays  have  this  feature. 

•  The  value  across  from  H200  is  the  value  of  the  highest  wave  in  the 

sample.  The  values  in  the  columns  H5  is  the  height  such  that  for  5%  of 

4 

the  random  samples  of  200  waves  with  an  (E  )  value  from  the  main  diagonal 
(or  as  tabulated),  the  highest  wave  in  the  sample  would  be  less  than  the 
tabulated  value.  Those  in  HMP  and  H200  are  respectively  the  most  probable 
and  the  expected  values  for  the  highest  wave  in  200.  The  value  H95  is  the 
value  to  be  exceeded  only  5%  of  the  time.  For  record  49,  the  sample  value 
of  46.5  lies  between  H5  and  HMP  for  all  values  except  the  two  from  the 
spectrum.  For  record  50,  the  sample  value  of  65.4  is  greater  than  H95  for 
all  values  except  H10M,  HSIM  and  HSPT. 


TABLE  8  Statistics  Obtained  from  Samples  49  and  50  as  Defined 
in  the  Text. 
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If  Eq  were  really  known,  and  if  all  the  other  ifs  involved  were  correct, 

the  heights  in  a  particular  sample  could  be  normalized  by  Eq.  (111).  Double 

the  normalized  value  would  then  correspond  to  the  corresponding  heights  for 

a  spectrum  with  a  value  of  Eq  equal  to  one.  Since  only  various  estimates  of 

the  value  of  E  are  available,  the  statistics  have  been  normalized  in  three 
o 

different  ways  by  using  estimates  based  on  HCDF,  HBAR  and  HMLE  so  as  to 
obtain  normalized  values  of  the  quantities  across  the  top  in  the  eight  by 
three  array  below  the  main  array.  Typically,  the  normalized  values  differ, 
for  the  two  samples  shown,  in  the  second  decimal  place. 

An  examination  of  the  tables  for  all  55  samples,  as  given  in  the  appendix, 
raises  a  large  number  of  interesting  questions.  One  might  be:  Despite  the 
indications  in  the  data,  is  it  possible  that  a  combination  of  reasons  could 
support  the  idea  that  heights  computed  from  the  area  under  the  spectrum  are 
the  best  values  to  use?  Another  could  be:  Have  actual  departures  of  samples 
of  200  consecutive  waves  been  shown, at  some  confidence  level,  to  differ  from 
the  Rayleigh  pdf?  An  attempt  to  provide  properly  qualified  answers  to  these 
questions,  and  others,  will  be  made. 

Figures  27,  28  and  29  are  plots  of  the  normalized  values  given  on  the 
bottom  three  rows  of  each  of  the  tables  in  the  appendix  with  the  normalizations 
based,  in  order,  on  the  best  fit  cdf,  the  mean,  and  the  maximum  likelihood 
estimate.  For  each  record  number  from  the  bottom  to  the  top,  the  value  are 
the  normalized  values  of  HBAR,  HSIP,  HSIG,  HSIM,  H10P,  H10H,  H10M  and  H200. 

All  three  figures,  at  first  glance,  are  remarkably  similar  except  that  the 
mean  normalized  by  the  mean  is  a  constant.  (The  appropriate  constant  was 
rounded  to  1.253,  and  calculations  faithfully  discovered  that  the  standard 
deviation  was  0.0003). 

In  general,  the  figures  show  that  the  smaller  the  sample;  i.e.  200,  67±7, 
20±4,  and  1,  the  larger  the  variability  of  the  estimate  about  its  expected 
value  as  shown  by  the  arrow  on  the  right  hand  side.  The  horizontal  bars  are 
analogous  to  confidence  intervals,  but, being  based  on  the  incorrect  assumption 
of  a  random  sample  of  200  values,  they  are  not  broad  enough  to  enclose  the 
expected  values  often  enough. 

Table  9  shows  the  distribution  of  the  normalized  mean  for  sample  intervals 
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TABLE  9 


Counts  per  Class  Interval  for  the  Normalized 
Mean.  Theoretical  Value  Equals  1.2533. 


TABLE  10  Counts  per  Class  Interval  for  Various  Estimates  of  the 
Significant  Wave  Height. Theoretical  Value  Equals  2.0023. 
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TABLE  11  Counts  per  Class  Interval  for  Various  Estimates  of  the  Average 
of  the  One  Tenth  Highest. Theoretical  Value  Equals  2.5504. 


CDF 


NORMALIZED  BY 

MEAN  MLE 


k.  • 


I  NT 
2.05 
2.10 
2.15 
2.20 
2.25 
2.30 
2.35 
2.40 
2.45 
2.50 
2.55 
2.60 
2.65 
2.70 
2.75 
2.80 
2.85 


HI OP  HI OH  H10M 


2 

5  3 

8  5  2 

3  7  5 

8  5  6 

9  9  3 

10  8  7 

5  8  6 

2  5  9 

2  2  5 

1  2  4 

3 

1  4 

1 


HI OP  HI OH  H10M 


4  1 

7  4 

5  7  3 

6  5  8 

8  7  3 

16  9  5 

5  12  7 

3  4  10 

14  7 

1  4 

1  6 

1 
1 


H10P  HI OH  H10M 


1 

7  3 

11  10  2 

9  7  6 

16  9  6 

8  14  6 

2  9  12 

1  2  10 

1  7 

4 
2 
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i  1  Afc*  *>  4^14^ 


of  0.02  with  the  lower  end  of  the  class  interval  tabulated  and  the  upper  end 
0.01999  greater.  The  CDF  normalizes  low  and  the  MLE  normalizes  high  compared 
to  the  expected  value  of  1.2533. 

Table  10  shows  the  distribution  of  the  various  estimates  of  the  normal¬ 
ized  significant  height.  It  is  notable  that  the  estimates  normalized  by  the 
MLE  and  HSIM  come  the  closest  to  scattering  uniformly  about  the  expected 
value  of  2.0023.  The  maximum  likelihood  estimate  of  (Eq) 2  is  generally 
smaller  than  the  one  based  on  the  mean.  It  is  smaller  because  high  values 
of  the  Ik  are  usually  missing  from  the  sample  as  described  in  the  section  on 
sample  cdf's.  Normalization  then  makes  the  other  estimates  larger.  When 
combined  with  a  too  small  correction  for  the  missing  high  values,  the  net 
effect  is  to  obtain  a  nearly  correct  value. 

Table  11  shows  the  distribution  of  the  various  estimates  of  the  average 
of  the  one  tenth  highest.  The  class  intervals  are  0.05  instead  of  0.02. 

The  spread  is  much  greater,  but  again  the  value  of  H10M  comes  the  closest  to 
scattering  the  least  about  the  value  of  2.5504. 

Table  12  shows  the  distribution  of  the  normalized  value  of  the  highest 
wave  in  each  sample.  Two  class  intervals  are  tabulated  under  each  interval 
heading  to  save  space.  For  example  for  CDF  at  2.8,  there  was  one  value 
between  2.8  and  2.84999  and  these  were  four  values  between  2.85  and  2.8999. 
The  very  large  scatter  of  the  normalized  values  is  better  judged  from  Figs. 
27,  28  and  29.  Most  of  the  data  are  below  the  value  of  3.44. 

Table  13  shows  the  means  minus  and  plus  one  standard  deviation  for  each 
of  the  eight  quantities  for  the  normalized  values  from  the  55  samples. 
Underlined  pairs  show  that  the  theoretical  expected  value  lies  within  the 
range  shown.  The  results  are  exactly  what  one  would  anticipate  from  the 
preceding  analyses  and  are  more  easily  interpreted  in  terms  of  Table  14 
which  expresses  all  of  the  values  in  terms  of  percentages  of  the  theoretical 
values.  For  H,  computed  two  ways  other  than  from  H  itself,  one  methods  is 
0.7%  too  low  but  is  within  plus  one  standard  deviation.  The  other  is  1% 
too  high  hut  is  within  minus  one  standard  deviation. 

The  normalized  significant  height  averages  to  2.9%,  2.2%  and  1.2%  less 
than  the  expected  value,  but  the  range  of  the  average  plus  one  standard 


deviation  encloses  the  theoretical  value  for  all  three  normalizations. 
Averaging  the  sixty  highest  produces  an  average  within  0.4%,  0.3%  and  1.3% 
of  the  expected  value.  The  theoretical  value  is  on  the  high  side  when 
normalized  by  the  CDF  and  on  the  low  side  when  normalized  by  both  the  mean, 
and  the  MLE,  but  still  within  one  standard  deviation. 

The  normalized  average  of  the  one  tenth  highest  waves  is  low  and  adding 
one  standard  deviation  does  not  enclose  the  theoretical  value.  The  average 
of  the  sixteen  highest  yields  average  normalized  values  that  are  low  by  2.7% 
2.0%  and  1.1%  and  lie  within  plus  one  standard  deviations  of  the  expected 
value . 

The  average  of  the  normalized  values  of  the  highest  wave  in  each  sample 
is  too  low  by  8.8%,  7.7%  and  6. 9%, but  plus  one  standard  deviation  encloses  the 
expected  value.  The  distribution  of  these  values  has  many  strange  features, 
to  be  discussed  later. 


TABLE  14  NORMALIZED  WAVE  STATISTICS  AS  PERCENT  OF  THEORY  FROM  TABLE  13. 


DOES  Eq  FROM  THE  SPECTRUM  PREDICT  THE  SIGNIFICANT  HEIGHT? 


Figures  1  and  2  and  Tables  1,  2,  3,  and  8,  along  with  the  other  54  tables 
in  the  appendix,  plus  the  theories  appropriate  to  Figures  17  and  18,  plus 
numerous  references,  plus  the  lack  of  a  random  sample  and  the  peculiarities 
of  the  sample  cdf,  all  seem  to  suggest  that  the  significant  wave  height  is  not 
correctly  given  by  either  eqn.  (3)  (or  eqn.(118))  if  the  area  under  the  wave 
spectrum  is  used  to  find  E  . 

The  theories  that  support  this  concept  are  all  variations  of  a  linear 
underlying  model  and  cannot  describe  the  effects  of  higher  order  nonlinearities 
that  could  increase  the  heights  of  the  higher  waves  in  a  time  history  so  that 
the  effects  treated  in  the  present  theories  would  be  cancelled  out,  either  in 
part,  completely  or  overcompensated  for.  Most  wave  recording  methods  at 
present  use  a  digitized  sample,  obtained  every  At  seconds,  with  2n  values, 
find  the  estimate  of  the  spectrum,  truncate  it  at  some  upper  frequency,  find 
the  area  under  the  truncated  spectrum  and  compute  the  significant  wave  height. 

This  procedure,  apart  from  problems  in  the  frequency  response  of  the  wave 
recorder,  can  introduce  an  additional  contribution  to  the  variance  of  the  time 
history  because  of  digitization  noise  that  may  or  may  not  be  white. 

If  for  exanple. 


n(o),  n(At),  n(2At) . n((n-i)At) 

is  the  record  that  ought  to  have  been  digitized,  and  if  each  value  has  an 
error  such  that 

.  n(o)  +  e(o),  n(At)  +  e(At),  n(2At)  +  e(2At), . 

n((n  -  1) At)  +  e((n  -  l)At), 


then 


VAR(n(t)  +  e(t))  is  estimated  by 

1  n_1  7 

VAR  =  -  l  (n(pAt)  +  e(pAt) ) 

1  o 

1  f  2  2 

-  -  l  ^Cn(pAt))  +  2n(pAt)  e(pAt)  +  U(pAt)) 


(130) 


The  second  term  in  eqn.  (130)  may  average  to  zero,  but  the  last  one  cannot. 
The  errors  may  even  have  a  spectrum  that  is  not  simply  more  or  less  a  constant 
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at  all  frequencies  below  (2At) The  variance  found  from  a  particular 
time  history  can  then  be  too  high,  and  when  the  significant  height  is 
computed  from  this  variance,  it  is  too  high,  for  one  more  reason  other  than 
all  of  the  others  found  so  far,  compared  to  the  average  of  the  one  third 
highest  waves  in  the  same  sample. 

Given  only  a  sample  of  crest  to  trough  wave  heights,  plus  the  fact  that 
successive  values  are  not  independent,  plus  the  particul iarities  of  sample 
cdf's  from  a  Rayleigh-like  population,  there  may  be  no  optimum  way  either  to 
obtain  the  best  fit  value  for  the  single  Rayleigh  parameter  or  to  find  out 
which  of  many  possibilities  may  modify  the  Rayleigh  pdf  to  some  other  pdf 
with  additional  parameters  that  result  in  forms  that  have  a  limiting  shape 
of  the  Rayleigh. 

The  best  fit  cdf,  the  mean,  and  the  maximum  likelihood  estimate  for  E 

o 

(or  Eq 2)  give  slightly  different  results  that  are  all  below  the  value  from 
the  spectra  of  the  waves  from  Camille.  The  lack  of  high  waves  in  the  sample 
of  limited  effective  size  not  only  affects  the  direct  estimates  of  the 
significant  wave  height  and  the  average  of  the  one  tenth  highest  but  also 
affects  the  estimates  that  involve  the  entire  sample. 

Figure  30  shows  schematically  the  kind  of  thing  that  can  happen  for  a 

sample  from  a  Rayleigh  pdf.  The  solid  curve  is  the  normalized  pdf  and  the 

dashes  represent  a  smoothed  version  of  a  possible  sample.  The  estimate  of 

the  mean  is  shown  schematically  in  Fig.  31.  In  Fig.  30  two  thirds  of  the  area 

\. 

under  the  theoretical  pdf  lies  to  the  left  of  (2  £r,  3)  2,  seven  thirtieths 

u  u  v 

lies  between  (2  £r|  3)  2  and  (2  £r|  10)  2  and  one  tenth  lies  above  (2  inlO)  2. 

*  1^ 

The  contribution  to  the  mean  for  values  less  than  (2  £r)  3)  2  can  be  too  large 

compared  to  what  ought  to  have  been  found,  and  the  contribution  for  values 

greater  than  (2  £ri  10)  2  could  be  too  small  as  indicated  in  Fig.  31. 

Consider  the  values  that  contribute  to  the  normalized  mean  as  in 
equation  (131), 
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which  upon  integration  yields. 


— - r-  =  0.58586  +  0.41240  +  0.25504  (132) 

2(Eq)^ 

If  the  sample  is  such  that  it  does  not  contain  enough  high  waves,  then 
there  must  be  an  excess  of  lower  waves.  For  an  actual  sample,  the  contribution 
to  the  mean  from  the  lowest  2  n/3  waves  and  from  the  highest  n/10  waves  ought 
to  have  a  ratio  of  (0.58586)/ (0.25504)  =  2.29713.  The  appropriate  sums  can  be 

133  180  200 

h  =  200  ![  hi  200  ^34  hi  +  200"  ^gl  hi  (133) 

found  with  some  ingenuity  from  the  values  in  Table  8,  and  the  corresponding 
tables  for  the  total  of  55  samples  in  the  appendix.  The  column  for  H10H  provides 
additional  significant  figures  for  the  calculation. 

Table  15  shows  the  results  for  the  55  samples  tabulated  according  to  sample 

number.  The  first  value  is  the  value  from  the  spectrum.  The  second  is  the  value 

from  the  highest  20  waves.  The  third,  R^,  is  the  value  of  the  ratio  of  the 

first  term  in  (133)  to  the  value  of  the  last  term  in  (133).  For  most  samples, 

this  ratio  exceeds  the  value  of  2.29713.  The  ratio  of  the  sample  ratio  to 

the  theoretical  ratio  is  shown  in  the  column,  R  .  This  ratio  times  the  value 

n 

estimated  from  the  one  tenth  highest  waves  yields  a  corrected  value  for  the 
Rayleigh  parameter.  The  difference  before  correction  is  shown  by  and  after 
correction  by  D^.  If  H10M  is  used  instead  of  H10H  and  the  calculation  repeated 
(with  the  value  treated  as  if  it  were  the  average  of  the  20  highest  waves)  the 
final  difference  is  given  by  D_. 


TABLE  15 


Possible  Effect  of  Sample  Size  on  the  Estimation  of 
Rayleigh  Parameter 


(EOSP)55 

(E10)^ 

R1 

1 

0.902 

0.765 

2.636 

2 

0.902 

0.725 

2.869 

3 

0.902 

0.745 

2.705 

4 

1.039 

0.863 

2.865 

5 

1.078 

0.852 

2.504 

6 

1.118 

0.902 

2.812 

7 

1.098 

0.882 

2.874 

8 

1.235 

1.019 

2.742 

9 

1.235 

1.078 

2.593 

10 

1.333 

1.196 

2  .391 

11 

1.666 

1.372 

2.748 

12 

1.686 

1 .353 

2.595 

13 

1.725 

1.510 

2.455 

14 

1.843 

1.627 

2.355 

15 

2.059 

1.588 

2.702 

16 

1.961 

1.725 

2.411 

17 

2.059 

1.745 

2  .458 

18 

2.117 

1.902 

2.221 

19 

2.333 

2  .000 

2.209 

20 

2.411 

2.117 

2.795 

21 

2.509 

2.294 

2.409 

22 

2.706 

2.490 

2.152 

23 

2.784 

2.451 

2.253 

24 

2.686 

2.313 

2.239 

25 

2.666 

2.215 

2.465 

26 

2.745 

2.333 

2.479 

27 

2.823 

2.490 

2.690 

28 

2.980 

2.627 

2.424 

29 

3.058 

2.588 

2.663 

30 

2  .960 

2.509 

2.513 

31 

3.176 

2.725 

2.480 

32 

3.215 

2.804 

2.411 

33 

3.333 

3.039 

2.396 

34 

3  .-196 

2.862 

2.384 

35 

3.411 

2.980 

2.351 

36 

3.548 

3.137 

2.193 

37 

3.686 

3.529 

2.191 

38 

3.921 

3.627 

2.095 

39 

4.097 

3.607 

2.432 

40 

4.117 

3.646 

2.370 

41 

4.391 

3.745 

2.604 

42 

4.803 

4.215 

2.622 

43 

4  .940 

4  .490 

2.374 

44 

5.740 

5.038 

2.399 

45 

5.842 

5.313 

2.447 

46 

6.509 

5  .979 

2.391 

47 

7.430 

6.724 

2.367 

48 

7.999 

7.528 

2.314 

49 

8.038 

7.411 

2.324 

50 

8.234 

7.822 

2.251 

R 

n 

(E10C)15 

D1 

°2 

1.148 

0.877 

0.14 

0.02 

1.249 

0.906 

0.17 

-  0.00 

1.178 

0.877 

0.15 

0.02 

1.247 

1.076 

0.18 

-0.04 

1.090 

0.929 

0.23 

0.15 

1.224 

1.104 

0.22 

0.02 

1.251 

1.104 

0.22 

-0.01 

1.194 

1.217 

0.22 

0.02 

1.129 

1 .217 

0.16 

0.02 

1.041 

1 .245 

0.13 

0.09 

1.196 

1.642 

0.30 

0.03 

1.130 

1.528 

0.34 

0.16 

1.069 

1.613 

0.21 

0.12 

1.025 

1.669 

0.21 

0.17 

1.176 

1.868 

0.47 

0.17 

1.050 

1.811 

0.23 

0.15 

1.070 

1.867 

0.32 

0.19 

0.967 

1.838 

0.22 

0.28 

0.962 

1.923 

0.33 

0.41 

1.217 

2.576 

0.29 

-0.15 

1.049 

2.405 

0.22 

0.10 

0.937 

2.333 

0.22 

0.38 

0.981 

2.403 

0.33 

0.38 

0.975 

2.254 

0.38 

0.43 

1.073 

2.377 

0.45 

0.29 

1.079 

2.518 

0.41 

0.22 

1.171 

2.915 

0.33 

-0.10 

1.055 

2.772 

0.35 

0.21 

1.159 

3.000 

0.47 

0.06 

1.094 

2.745 

0.45 

0.21 

1.080 

2.942 

0.45 

0.24 

1.049 

2.942 

0.42 

0.28 

1.043 

3.169 

0.29 

0.16 

1.038 

2.970 

0.30 

0.23 

1.025 

3.055 

0.43 

0.35 

0.955 

2.994 

0.41 

0.56 

0.954 

3.365 

0.16 

0.31 

0.912 

3.308 

0.29 

0.61 

1 .059 

3.820 

0.49 

0.26 

1.032 

3.762 

0.47 

0.36 

1.133 

4.244 

0.65 

0.15 

1.141 

4.811 

0.58 

-0.01 

1.033 

4.639 

0.45 

0.30 

1.044 

5.262 

0.70 

0.48 

1.065 

5  .658 

0.53 

0.18 

1.041 

6.223 

0.53 

0.29 

1.031 

6.930 

0.69 

0.50 

1.007 

7.582 

0.47 

0.42 

1.012 

7.497 

0.62 

0.54 

0.980 

7.665 

0.41 

0.56 

Continued  on  page  166. 
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0.00 

-0.05 

0.00 

-0.06 

0.10 

-0.01 

-0.05 

-0.02 

-0.02 

0.08 

-0.02 

0.12 

0.02 

0.11 

0.15 

0.09 

0.13 

0.18 

0.33 

-0.24 

0.00 

0.30 

0.30 

0.36 

0.21 

0.12 

-0.23 

0.11 

-0.01 

0.11 

0.15 

0.16 

0.00 

0.11 

0.23 

0.44 

0.17 

0.45 

-0.13 

-0.22 

-0.03 

-0.12 

-0.12 

-0.25 

0.03 

-0.06 

-0.30 

0.04 

-0.21 

-0.18 
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The  value  of  R^  is  greater  than  one  for  45  out  of  55  samples.  The 
corrected  value  of  (ElOC)^  exceeds  the  value  from  the  spectrum  eight  times. 
However,  the  averages  of  the  ratios  of  (E10C)  2  to  (EOSP)  2  is  only  0.94  so 
that  heights  found  from  the  spectrum  are  still  high  compared  to  heights  from 
the  crest  to  trough  height  samples.  The  values  for  show  that  corrections 
based  on  H10M  may  overcompensate  for  the  effects  considered  since  the  corrected 
value  exceeds  or  essentially  equals  the  spectral  value  27  times. 

The  one  standard  deviation  bounds  on  HSIG+,  HSIG,  H10+,  H10  and  H10-  in 
Table  14  all  enclose  94%  so  that  it  may  be  asking  too  much  to  have  a  partic¬ 
ular  sample  of  significant  wave  heights  verify  within  plus  or  minus  10%  when 
the  spectral  variance  is  used.  There  is  no  convenient  way  to  study  the 
hypothesis  that  digitization  errors  account  for  the  differences  that  are 
observed  with  the  presently  available  data.  The  complete  wave  recording 
system  would  have  to  be  analysed  step  by  step. 

It  is  difficult  enough  to  record  waves  72  feet,  65  feet,  69  feet  and  70 
feet  from  crest  to  trough  without  quibbling  about  a  few  feet  of  possible  error 
in  the  measurements.  Most  other  wave  measuring  system  have  far  greater 
difficulties  than  the  system  used  to  measure  these  waves  in  Camille. 

The  choice  that  needs  to  be  made  is  whether  or  not  to  use  the  significant 
height  based  either  on  HBAR  or  HMLE  or  on  the  spectral  moment.  The  largest 
difference  is  2.7  feet  out  of  35  feet  (from  the  spectrum)  for  record  52  so  that 
the  significant  height  from  the  spectrum  is  8%  higher  compared  to  that  from 
the  MLE.  It  seems  the  better  part  of  wisdom  until  more  is  known  about  the 
vagaries  of  a  particular  wave  recording  instrument  system,  to  use  the  spectral 
moment  with  the  reservation  that  the  result  may  be  slightly  too  high  for  a 
particular  sample. 
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EXTREME  WAVE  HEIGHTS 


If  F(x)  is  the  cdf  appropriate  to  a  random  sample  of  size  N,  the  cdf 
of  the  largest  value  is  given  by  equation  (134) 

Fe(x)  =  (1  -  F (x) )N  (134) 

The  pdf  of  the  highest  value  in  the  sample  is  given  by 

fE(x)  =  N(1  -  F(x))N1  d  F(x)/dx  (135) 


For  the  normalized  Rayleigh  distribution,  these  equations  become 
2 

F£(x)  =  (1  -  e"X  /2)N  (136) 


and 


*  r  i  -x2/2, N-l  -x2/2 

f  (x)  =  N(1  -  e  )  xe 


(137) 


Figures  32,  33  and  34  show  the  normalized  sample  cdf's  based  on  the 
best  fit  cdf,  the  mean,  and  the  maximum  likelihood  estimate.  Also  shown  are 
the  graphs  of  equation  (136)  for  N  equal  to  200,  100  and  50.  The  three 

theoretical  curves  nearly  parallel  each  other  and  the  distances  at  p  =  0.5 
separating  the  curves  for  50  and  100  and  for  100  and  200  are  almost  the  same. 
This  is,  of  course,  a  feature  of  the  extreme  value  theory  of  Longuit -Higgins 
(1952)  where  the  various  properties  of  the  extreme  values  vary  as  the 
logarithm  of  the  sample  size.  Doubling  the  sample  size  to  200  from  100  only 
increases  the  expected  value  of  the  highest  wave  in  the  sample  by  about  15%. 

The  sample  cdf's,  based  on  55  normalized  extreme  values,  have  little, 
if  any,  relation  to  the  theory  of  extreme  values  in  a  random  sample  of  200 
waves.  Even  the  concept  of  an  equivalent  sample  size,  so  useful  in  the 
interpretation  of  spectral  estimates,  leads  to  difficulties  since  this 
quantity  is  not  necessarily  constant  from  one  sample  to  another.  The  three 
sample  cdf's  shift  toward  higher  values  depending  on  which  of  the  three  ways 
the  normalizing  parameter  is  chosen  with  normalization  by  the  maximum  like¬ 
lihood  estimate  being  the  farthest  to  the  right.  This  shows  that  overall, 
this  parameter  is  smaller  than  the  one  found  from  the  mean.  The  .higher  waves 
some  of  which  are  missing  from  a  particular  sample,  cause  this  parameter  to 
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FIG.  34  Sample  Extreme  Value  cdf  and  Theoretical  Cu 
of  each  Sample. 
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perhaps  be  underestimated  because  they  have  a  greater  effect  on  the  estimate. 

The  relative  concentration  of  the  normalized  values  relative  to  the  MLE  as  in 
Tables  10  and  11  is  also  explained  by  this  effect  since  this  choice  of  para¬ 
meter  emphasises  the  higher  waves  in  a  sample. 

Both  Figures  33  and  34  suggest  that  the  effective  sample  size  may  be  some¬ 
where  between  80  and  100  for  p  up  to  49.  However,  the  six  highest  values  for 
normalized  H200  for  records  14,  18,  33,  35,  48  and  50  switch  from  values  near 
the  theoretical  curve  for  100  to  the  theoretical  curve  for  200.  The  11% 
highest  waves  in  repeated  samples  of  200,  that  is  the  extreme  of  the  extremes, 
are  predicted  quite  well  by  the  theory.  Of  these  six  extremes,  the  first 
three  are  close  to  the  value  for  H95.  For  record  35  only  the  theoretical 
value  from  the  spectral  variance  exceeds  the  observed  height.  For  record  48, 
all  theoretical  values  are  less  than  the  observed  height.  For  record  50,  all 
but  three  are  less  than  the  observed  height. 

For  record  48,  if  the  spectral  parameter  is  used,  the  chance  of  exceeding 
a  height  of  72.2  feet  is  0.00784  for  a  random  sample  of  200  waves.  For  55 
random  samples  of  200  waves  the  chance  of  a  wave  this  large  or  larger  is  one 
or  more  of  the  samples  is  0.351  so  that  this  wave  is  not  unusual  if  the  extreme 
value  theory  is  applied  to  a  random  sample  of  200.  If  the  sample  size  were 
only  100,  the  chance  of  finding  a  wave  this  high  or  higher  in  55  sample  would 
be  0.195,  which  is  not  too  improbable.* 

Based  on  the  peculiar  properties  of  sample  cdf's  for  the  individual  samples, 
and  on  the  lack  of  a  random  sample,  there  is  no  convincing  evidence  that  the 
higher  waves  in  a  Rayleigh  pdf  are  eliminated  by  breaking  and  that  the  pdf  is 
modified  in  some  way  to  account  for  breaking.  The  heights  that  can  be  attained 
by  irregular  trains  of  waves  of  different  heights  are  not  well  understood,  and 
the  upper  limit  on  how  high  they  can  get  may  not  be  discernable  in  typical 
samples. 

* 

For  a  moment  based  on  HCDF,  which  is  probably  too  low;  the  corresponding 
values  are  0.00145,  0.08  and  0.04,  a  modestly  rare  occurrence. 


A  REVISIT  OF  THE  RESULTS  OF  SPRING 


Figures  1,  2  and  3  of  this  present  study  and  Tables  1,  2  and  3  on  pages 
10  to  20  showed  only  the  properties  of  the  initial  stages  of  the  analysis  of 
this  same  data  by  Spring  (1978).  It  is  now  possible  in  view  of  the  various 
points  raised  in  the  preceding  sections  to  return  to  his  study,  reinterpret 
the  results  that  were  obtained  and  suggest  possible  alternative  conclusions 
that  might  be  drawn  from  his  analysis. 

In  describing  how  tests  were  made  to  see  whether  or  not  the  samples  of 

crest  to  trough  wave  heights  would  fit  the  Rayleigh  pdf,  the  following 

material  is  quoted  directly  from  Spring  with  changes  in  equation  numbers 

★ 

and  figure  numbers. 

"Borgman  (1973)  describes  a  method,  attributed  to  D.  Lee  Harris  of  the 
Coastal  Engineering  Research  Center,  to  test  the  appropriateness  of  a 
distribution  function  to  ranked  data.  The  method  has  been  used  in  this 
study  to  test  the  Rayleigh,  Forristall  and  a  mean  height  distribution  and 
is  described  below. 


"According  to  Hogg  and  Craig  (1970,  page  13),  an  estimate  of  the 
distribution  function  from  ranked  data  may  be  obtained  from 


P[h 


N-r+1 

N+l 


(138) 


where  h^  is  the  height  of  the  rth  largest  observed  wave,  N  is  the  total 
sample  size  and  r  is  the  wave  rank  after  the  observed  waves  are  sorted  into 
descending  heights.  For  the  Rayleigh  distribution 

2  2 

P[h(r)l  =  l-e'h  /a  (139) 

where  =  8mQ  and  mQ  is  the  total  variance  of  a  record.  Combining  equations 
f 1 38)  and  (139)  and  solving  for  r,  one  obtains 

ln  r  =  -  (o'  m0l/")  h(r)  +  ln  (N  +  !)•  (140) 


Thus,  a  plot  of  ln  r  versus  the  wave  height  squared  should  yield  a 
straight  line  fitted  by  the  parameters,  mQ  and  N,  if  the  tested  wave  heights 
are  obtained  from  a  sample  which  is  Rayleigh  distributed. 

*A  typographical  error  in  (138)  has  been  corrected  by  charging  -1  to  +1  in 
the  numerator. 


"Confidence  limits,  or  probability  intervals,  can  be  computed  as 
described  by  Borgman  (1973).  These  confidence  limits  can  be  employed  to 
observe  whether  the  oscillations  of  the  data  around  the  theoretical  relation¬ 
ship  are  causes  to  accept  or  reject  the  theoretical  relationship.  According 
to  Fisz  (1963,  pg.  375,  equation  10.3.7),  the  distribution  function  for  the 
rth  largest  observation  taken  from  a  Rayleigh  population  would  be 


r-1  M  ,2,  2  k  ,2.2  N-k 
PH(r)  lh)  =  £=„  <  k><e'  >  > 


(141) 


"To  obtain  the  upper  and  lower  95%  confidence  interval,  equation  141  is 
solved  for  (h)  equal  to  0.975  and  0.025  respectively  by  the  method 

outlined  in  the  NBS  Handbook  of  Mathematical  Functions  (1964,  pg.  960,  examples 
18  and  20).  The  method  is  briefly  outlined  below  for  the  upper  confidence 
interval . 

Example  18  from  the  above  reference  shows 


r  1  ,N  ,  k  N-k  .  r  ,N  ,  k  N-k 

L  (  JP  q  =1-1  (  k)P  q 

k=0  k=r 


(142) 


-  .  -h2/^ 


where  p  =  e  and  q  =  1-p.  The  second  term  on  the  right  hand  side  of 

equation  (142)  is  related  to  the  F-distribution  as 


l  (Nk)pV'k  =  QCF|V  ,v2) 

k=r 


(143) 


where  =  2(N-r+l)  and  =  2r.  Combining  equations  (141,  142,  143),  and 
FH(r) (h)  =  0.975,  the  following  is  obtained 


‘  Q  (F|vi,v2)  =  0.025 


(144) 


"The  appropriate  F-distribution  table  is  used  to  determine  a  value  of  F 
based  on  and  v^,  which  in  turn  is  used  to  solve  for  p  by 


P  =  ^2/(v2  +  VPF 


The  upper  confidence  point  is  found  by 


2  2 
h  =  -a  In  p 
up 


(145) 


(146) 


"Thus,  by  varying  the  rank,  r,  appropriate  values  of  h  are  found  which 
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determine  the  upper  95%  confidence  envelope.  A  similar  approach  is  used  to 
determine  the  lower  envelope  employing  example  20  of  the  above  reference. 

"Two  modifications  to  Borgman ' s  approach  were  performed  for  this  study. 

The  first  was  to  rank  and  test  all  the  waves  in  a  record  as  opposed  to 

Borgman' s  use  of  the  top  19  waves  only.  The  second  was  to  specify  a  slope 

2 

for  the  data  by  the  use  of  a  =  8mQ,  Borgman  had  determined  a  least  squares 

fit  of  the  data  to  a  straight  line  to  determine  a^.  The  author  believes  that 

this  approach  is  a  true  test  of  the  Rayleigh  distribution  as  Borgman' s 

approach  tests  only  the  relationship  of  the  data  to  a  height  squared  fit  as 
2 

a  ,  the  slope  is  continually  changing  with  each  record  tested.  - 

"The  basic  data  record  unit  of  450  digitized  points  (7.5  minutes  of  data) 

was  tested  as  a  7.5  minute  block  and  also  combined  into  a  15  minute  block  and 

a  30  minute  block  and  then  tested.  Sample  plots  of  In  r  versus  height  squared 

for  file  1  data  are  shown  in  Figures  35  to  38.  For  four  7.5  minutes  records, 

in  Figures  39  and  40  for  two  15  minute  records  and  in  Figure  41  for  one  30 

minute  record.  (Note:  The  four  7.5  minute  records  were  combined  into  the 

two  15  minute  records  which  were  in  turn  combined  into  the  30  minute  record) . 

In  the  figures,  only  the  top  seventy  waves  were  plotted  as  crosses,  but  all 

wave  heights  within  the  record  were  checked  to  see  if  the  confidence  intervals 

2 

were  exceeded.  The  slope,  as  determined  by  a  ,  is  shown  as  a  straight  line 
for  the  first  40  points  only,  while  the  upper  and  lower  confidence  envelopes 

are  shown  as  the  two  outrider  lines.  -  As  can  be  observed,  the 

inclusion  of  all  plots  would  make  this  report  too  voluminous  and  (these  figures) 
will  be  the  only  ones  presented  for  the  Rayleigh  distribution. 


"The  fit  to  each  record  was  determined  by  comparing  h,  .  to  the  upper 


envelope  height,  ^  ,  and  the  lower  envelope  height,  h^owfr) 


If  h.  .  was 
(r) 


greater  than  h  ,  ,  or  lower  than  h,  ,  . ,  the  record  was  considered  a  bad 
6  up(r)  low(r) 

fit.  The  total  number  of  heights  falling  outside  the  confidence  envelopes 
was  determined  as  a  percent  of  the  total  number  of  wave  heights  in  the  record. 


"Figure  37,  shows  record  55,  file  1  which  is  the  only  record  shown  which 
passes  the  above  described  fitting  criteria.  Figure  36,  shows  record  54, 
file  1  in  which  data  falls  outside  both  the  upper  and  lower  confidence  levels. 
The  fact  that  most  of  the  data  falls  below  the  center  line  of  the  figure 
should  not  be  surprising  as  the  line  was  determined  theoretically  by  8  mQ  and 
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R  (PLOTTED  ON  LOG  SCALE) 


Figui'e  39 

Rayleigh  Fit,  File  1,  Record  27,  15  Minute 

R  VS  HAVE  HEIGHT  SQUARED  RECORD  27 
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Figure  40 

Rayleigh  Fit,  File  1,  Record  28,  15  Minute 

R  VS  HAVE  HEIGHT  SQUARED  RECORD  28 
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not  by  a  least  squares  fit  to  the  data.  The  upper  confidence  envelope  in 
Figure  36  is  exceeded  at  points  65  through  70,  while  data  falls  below  the 
lower  lower  confidence  envelope  for  points  11  through  30.  This  type  of 
occurrence  was  not  uncommon  and  always  occurred  with  the  upper  limit  being 
exceeded  at  the  low  wave  height  tail  of  the  record  and  the  lower  limit  being 
exceeded  at  the  start  or  middle  of  the  record.  This  is  a  mild  example  of  a 
record  exceeding  both  confidence  intervals  but  there  are  many  more  showing  a 
much  worse  fit.  Figure  40  shows  the  result  of  combining  two  7.5  minute 
records  (numbers  55  and  56),  one  of  which  failed  the  test  and  the  second 
passing  the  test,  into  one  15  minute  record  (number  28)  which  fails  the  test 
criteria.  Figure  39  shows  that  combining  two  7.5  minute  records  (numbers  53 
and  54),  which  just  barely  fail  the  test,  into  one  15  minute  record  could 
result  in  a  fit  worse  than  either  of  the  7.5  minute  fits.  Figure  41  shows 
the  four  7.5  minute  records,  or  the  two  15  minute  records,  combined  into  one 
30  minute  record  which  fails  the  test.  In  all  figures,  the  first  point 
plotted  is  right  on  the  X  axis  and  can  easily  be  spotted  as  being  the  only 
tick  mark  without  any  scale  variable  beneath  it. 

"Table  16  presents  the  results  of  this  analysis  for  the  7.5  minute 
records  of  File  1  with  the  following  notation: 

Hs:  The  significant  wave  height  as  determined  by  "bump  count". 

SWP:  The  spectral  width  parameter  as  determined  by  |T  and  T  |. 

#WAVES:  The  total  number  of  waves  of  the  record  to  be  analyzed. 

RHs:  The  significant  wave  height  as  determined  by  4.005  mQ  1/2. 

Y/N:  Fit  of  the  data  to  the  Rayleigh  distribution  (Y  =  yes,  N  =  no). 

H/L:  Where  the  data  exceeds  the  95%  confidence  envelope  (H=upper  curve, 

L= lower  curve). 

%:  Total  percent  of  points  falling  outside  the  95%  confidence  envelope. 

"Table  17,  presents  t!.o  results  for  the  15  minute  records  of  File  1  and 
Table  18  the  results  for  the  30  minute  records. 

"Summary  results  of  Tables  16,  17  and  18  are  presented  in  Tables  19,  20 

and  21.  -  The  tables  show  the  number  of  records  passing  or  failing 

by  five  foot  increments  of  significant  wave  height.  The  heading  all  points 
shows  the  number  of  records  that  had  at  least  one  point  fall  outside  the  95% 
confidence  envelope  and  the  heading  greater  than  5%  shows  the  number  of 
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records  that  had  at  least  5%  of  the  points  fall  outside  the  confidence 
envelope.  The  numbers  under  H  and  L  show  how  many  records  exceeded  the 
upper  and  lower  envelope.  If  a  record  exceeded  both  the  upper  and  lower 
confidence  envelope  (as  shown  in  Figure  17) ,  it  was  counted  under  both  H 
and  L,  thus  the  combined  value  of  H  and  L  could  exceed  the  number  of  records 
failed.  - " 

The  material  that  has  just  been  quoted  represents  a  serious  attempt 
to  find  out  whether  or  not  crest  to  trough  wave  heights  fit  the  Rayleigh 
pdf.  It  also  found  the  parameters  appropriate  to  equation  (114),  but 
these  will  not  be  considered.  Many  of  the  other  published  examples  simply 
compare  the  sample  distribution  with  the  theoretical  distribution,  or  with 
a  family  of  distributions,  and  the  lack  of  high  waves  is  considered  to  be 
prima  facie  evidence  that  the  Raleigh  pdf  is  not  a  good  fit  and  that  whatever 
new  model  is  proposed  is  superior. 

At  the  time  of  the  study  by  Spring  (1978),  which  was  a  doctoral  dis¬ 
sertation,  his  faculty  advisor  (W.  J.  Pierson),  was  unaware  of  the  implica¬ 
tions  of  the  various  assumptions  inherent  in  the  material  quoted  from  this 
work.  The  definition  of  the  sample  cdf  differs  from  the  accepted  one  given 
in  Mood,  Grabill,  and  Boes  (1963).  The  equation  for  the  distribution  of 

the  sample  cdf  is  correctly  stated,  but  the  approximation  by  the  F  distri- 

% 

bution  is  probably  valid  only  for  large  samples.  A  confidence  interval 
from  0.025  to  0.975  may  not  be  correctly  calculated  for  the  smaller  samples. 
The  value  of  mQ  (EQ)  calculated  from  the  spectrum  may  be  too  high  because  of 
digitization  noise.  The  samples  being  tested  are  not  random  samples,  and 
the  -effective  number  of  independent  wave  heights  is  less  than  the  value  of 
N  that  was  used. 

A  somewhat  smaller  value  of  mQ,  to  account  for  digitization  noise  may 
be  needed.  Wider  confidence  intervals  are  needed  to  account  for  a  smaller 
effective  sample.  They  should  be  actually  based  on  the  binomial  distribution, 
and  not  an  approximation  to  it.  These  changes  would  confine  more  of  the 
sample  cdf's  within  the  wider  acceptance  interval. 

With  these  points  in  mind,  consider  the  results  for  the  7.5  minute 
samples,  the  15  minute  samples  and  the  30  minute  samples.  For  the  180, 

7.5  minute  samples,  the  first  35  resulted  in  the  rejection  of  the  hypothesis 
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TABLE  21 


that  the  sample  was  Rayleigh  at  the  95%  level.  All  samples  went  below  the 
lower  limit  and  one  also  went  above.  For  the  second  35,  six  pass,  one  went 
outside  the  upper  bound  only,  two  were  both  above  and  below,  and  26  were 
below  and  so  on  for  the  complete  table. 

For  the  five  full  tables  for  35  7.5  minute  samples  to  summarize;  for 
the  first  35,  0  passes,  34  low  and  one  both  high-low;  for  the  next  35,  6 
passes,  26  low,  one  high,  and  2  both  high-low;  for  the  next  35,  13  passes, 
19  low,  one  high  and  3  high-low;  for  the  next  35,  21  passes,  12  low,  and 
2  both  high-low,  and  for  the  last  full  35,  15  passes,  14  low,  5  high  and 
one  both  high- low.  The  final  5  records  gave  2  passes,  2  low  and  one  high. 
The  totals  for  the  180  records  were  56  passes,  107  low,  8  high  and  9  high- 
low.  Of  the  124  that  did  not  pass,  there  were  48  for  which  only  5%  of  the 
sample  was  outside  of  the  bounds.  With  broader  bounds  to  account  for  the 
lack  of  a  random  sample,  with  the  use  of  a  binomial  pdf  to  define  these 
bounds,  and  with  the  use  of  a  somewhat  reduced  value  for  mQ,  instead  of 
31%  passing  (or  perhaps  (56  +  48)/180  which  equals  58%)  quite  a  few  might 
have  passed. 

The  15  minute  samples  can  be  interpreted  in  much  the  same  way.  The 
larger  the  sample  the  narrower  the  confidence  bounds  become  and  the  more 
sensitive  the  interpretation  to  the  effects  of  a  non-random  sample.  Of 
the  90  samples,  13  out  of  90  pass  (14%)  that  could  possibly  be  increased 
to  (13  +  22)/90  or  39%.  The  first  record  to  pass  is  record  64  suggesting 
that  the  possible  error  in  finding  mQ  is  larger  (in  percent)  for  the  lower 
waves.  Of  the  last  27  records,  48%  pass.  With  8  more  having  only  5%  of 
the  heights  outside  of  the  computed  range,  possibly  78%  could  have  passed 
as  a  result  of  these  effects. 

There  were  45  thirty  minute  samples.  The  effect  of  increased  values 
for  N,  with  the  non -randomness  of  the  sample  narrowed  the  confidence 
interval  so  that  only  4  samples,  9%,  passed.  The  percentage  of  points 
outside  of  the  confidence  interval  increased  markedly  with  only  9  under 
5%  so  that  only  29%  ((4  +  9)/45)  either  came  close  or  passed. 

The  stratification  of  the  data  into  significant  height  ranges  shows 
a  tendency  for  the  high  waves  to  fit  the  Rayleigh  but  not  the  lower  waves. 
Slightly  too  high  values  for  mQ  for  the  lower  waves  in  particular  could 
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account  for  this  effect.  This  possibility  seems  to  be  reinforced  by  Table 

15. 

The  method  of  plotting  the  data  does  no  affect  the  results  except  for 
the  redefinition  of  the  sample  cdf.  Note  that  the  point  for  R  equal  to  1  is 
a  tick  on  the  horizontal  axis.  It  has  been  marked  more  heavily  on  the  copies 
of  the  figures  from  Spring.  The  same  data  could  have  been  presented  in  a  form 
similar  to  Figure  32  to  34.  The  results  described  by  Spring  in  the  material 
quoted  closely  parallel  the  predictions  made  about  the  properties  of  small 
samples  in  the  preceding  material  of  our  study. 

These  results  need  to  be  contrasted  to  the  results  of  Donelan  and 
Pierson  (1983).  The  data  from  the  wind  wave  flume  under  known  stationary 
conditions  had  the  property  that  combining  16  samples  yielded  a  far  more 
smoothly  varying  spectrum  with  fairly  narrow  confidence  bounds.  Had  ten 
times  more  data  been  obtained  it  is  not  difficult  to  postulate  that  the 

composite  spectrum  would  have  been  smoother  and  that  the  confidence  interval 

u 

would  have  been  reduced  by  a  factor  of  (10)  .  For  the  Camille  data,  the 
combined  effect  of  non  stationarity  and  the  lack  of  a  random  sample  penalize 
large  samples  by  imposing  requirements  on  the  presumed  confidence  bounds 
that  are  too  strict. 


SUMMARY 


Data  corresponding  to  eleven  thousand  successive  crest  to  trough  wave 
heights  that  required  twenty  one  hours  and  forty  minutes  to  pass  a  Baylor 
wave  staff  at  an  offshore  oil  platform  were  studied  as  fifty  five  sets  of 
two  hundred  successive  waves.  The  spectra  for  this  sequence  of  waves 
generated  during  Hurricane  Camille  were  fairly  well  hindcast  (or  specified) 
by  the  wave  forecasting  model  described  by  Cardone,  Pierson  and  Ward  (1976) . 
Significant  heights  from  the  spectra  of  the  hindcast  model  compared  favorably 
with  significant  heights  from  the  spectra  estimated  from  the  wave  time 
histories  within  the  confidence  intervals  for  the  significant  height. 

At  the  start  of  the  period,  the  significant  heights  were  between  3  and 
4  feet.  Toward  the  end  of  the  period,  for  the  200  wave  samples,  the  signif¬ 
icant  heights  exceeded  30  feet,  and  the  last  sample  exceeded  40  feet. 

Individual  waves  over  50  feet  were  measured  for  record  48  (72.2  feet),  record 
50(65.4  feet),  record  51(68.5  feet)  and  record  55(70.4  feet). 

There  is  a  widening  gap  between  what  is  known  about  periodic  nonlinear 
waves,  periodic  nonlinear  breaking  waves,  the  various  attempts  tc  model  non¬ 
linear  irregular  waves  and  theories  for  the  interpretation  of  actual  ocean 
wave  time  histories.  Realistic  periodic  breaking  waves  car.  be  created  mathe¬ 
matically  that  are  caused  to  break  by,  among  other  effects,  applying  a  transient 
mathematical  atmospheric  pressure  field  (a  gust  of  wind?)  to  the  waves.  Witham 
(1965)  implies  that  exact  analytical  solutions  of  the  equations  of  motion  triat 
describe  real  waves  on  the  actual  ocean  are  "out  of  the  question". 

The  study  of  mechanically  generated  transient  wave  groups  has  produced 
realistic  extreme  waves  and  breaking  waves  that  can  be  exactly  predicted  after 
the  fact.  The  theories  of  Witham  do  not  seem  to  have  a  consistent  set  of 
derived  requirements,  but  they  may  be  capable  of  extention,  and  correction,  to 
account  for  random  waves  to  third  order. 

Studies  of  waves  actually  generated  by  the  wind  have  shown  that  irregu¬ 
larities  in  the  coastline  can  produce  directional  wave  spectra  that  have  values 
appropriate  to  sources  such  as  bays  and  coastline  indentations.  This  implies 
that  higher  resolution  model  in  space  and  time  will  be  needed  to  describe  waves. 
Higher  order  effects  in  wave  spectra  are  detectable  for  bound  second  harmonics, 
but  the  corrections  to  the  variance  spectra  are  two  orders  of  magnitude  lower 


than  the  first  order  peak.  Corrections,  according  to  these  results,  to  the 
phase  speeds  of  the  first  order  components  are  small.  If  to  the  contrary, 
a  Witham  -  type  theory  applies,  a  local-type  phase  speed  also  applies  that  is 
a  function  of  the  "local"  wave  height  and  "local"  frequency.  Although  the 
averages  may  verify  well,  individual  wave  groups  may  travel  at  quite  different 
speeds  and  group  velocities. 

The  theoretical  models  that  modify  the  Rayleigh  pdf  all  begin  with  the 
linear  wave  model  and  all  give  results  that  reduce  the  probabilities  of  the 
higher  waves  compared  to  the  Rayleigh.  One  new  model  theoretically  attempts 
to  account  for  wider  band  spectra  such  that  a  changing  wave  envelope  results 
in  a  regression  toward  the  middle  values  for  both  low  and  high  waves.  Another, 
of  several,  models  breaking  waves  and  concludes  that  there  is  a  deficit  of 
high  waves  compared  to  the  Rayleigh  model.  The  available  models  treat  those 
aspect  of  random  waves  that  always  result  in  a  lower  frequency  of  occurrence 
of  the  higher  waves.  Higher  order  effects  that  would  produce  higher  crest  to 
trough  heights,  even  at  third  order,  are  not  included.  The  change  in  wave 
form  as  waves  steepen  to  produce  sharp  crests  with  a  120°  ang  >' .  between  front 
and  back  (or  less)  are  not  reproduced.  It  is  not  known  whether  or  not  these 
unaccounted-for  effects  could  re -amplify  the  higher  actual  waves  so  as  to 
cancel  out  the  effects  so  far  predicted  by  the  linear  models. 

Verification  of  these  models  has  consisted  of  a  limited  number  of  sample 
height  distributions  plotted  against  the  theoretical  distributions  without 
tests  of  goodness-of-fit ,  the  consideration  of  the  required  properties  of  a 
random  sample  and  other  complications  having  to  do  with  the  actual  ways  that 
waves  are  recorded. 

The  analysis  of  these  fifty  five  samples  of  200  consecutive  waves  has 
been  restricted  to  the  time  domain  in  an  attempt  to  reconcile  the  properties 
of  sample  distributions  of  crest  to  trough  wave  heights  with  the  properties 
that  would  be  predicted  from  spectral  estimates  such  as  the  root  mean 
square  height,  the  mean  height, the  significant  height,  the  average  of  the 
one  tenth  highest  and  the  extreme  waves  in  each  sample.  In  so  doing,  an 
almost  insurmountable  difficulty  arises.  The  sample  of  200  waves  is  not  a 
random  sample. 

Although  the  fact  that  the  sample  is  not  a  random  sample  would  be 
obvious  from  the  properties  of  a  stationary  Gaussian  process,  it  is  not 


obvious  enough  to  raise  a  warning  flag  when  testing  whether  or  not  a  given 
sample  fits  a  given  model.  In  the  time  domain  alone,  the  covariance  function 
demonstrates  the  time  dependence  of  nearby  values,  but  the  time  (and  space) 
dependence  of  successive  waves  is  not  also  appreciated  as  a  corollary  of  this 
property. 

To  test  the  independence  of  successive  wave  heights,  the  heights  were 
modeled  as  a  ten  state  one  step  Markov  process.  The  model  proved  to  be  only 
approximate  in  that  powers  of  the  transitions  matrix  did  not  reproduce  the 
two  step  and  three  step  transitions.  As  a  ten  step  Markov  process,  which 
allowed  the  data  to  be  pooled  into  larger  samples  by  partially  removing 
trends,  the  results  showed  that  the  height  of  a  passing  wave  was  not  a 
useful  predictor  of  the  height  of  the  next  wave.  Any  one  of  the  ten  states 
could  follow  any  one  of  the  initial  ten  states.  There  was,  nevertheless,  a 
weak,  but  obvious,  dependence  of  the  height  of  the  following  wave  on  the  wave 
that  had  just  passed.  This  dependence  invalidates  the  assumption  that  the 
sample  is  a  random  sample. 

In  the  spectral  domain,  there  is  the  concept  of  an  effective  number  of 
independent  points  which  can  be  determined  from  the  spectral  estimate.  For 
time  correlated  wave  heights,  no  analogous  way  to  determine  the  effective 
number  of  independent  heights  in  the  sample  was  found,  but  it  is  clear  that 
the  effective  number  is  less  than  the  actual  number  and  that  the  effective 
number  decreased  for  the  waves  in  Camille  as  heights  of  the  waves  increased. 

The  available  methods  to  test  the  goodness-of-fit  of  a  samnle  to  a 
theory  that  are  applicable  to  this  problem  are  all  asymptotically  valid  for 
large  samples.  A  sample  of  200  values,  in  fact  less,  is  not  a  large  sample, 
and  so  the  tests  can  be  misleading.  For  a  small  sample,  there  is  a  demon¬ 
strable  tendency  to  miss  the  lower  and  higher  values  in  the  sample  and  to 
over  sample  the  middle  range  of  values.  Combined  with  the  smaller  samples 
possible  for  a  truncated  distribution,  this  in  turn  leads  to  highly  variable 
estimates  of  truncated  moments  that  more  often  than  not  are  too  low  due  to 
chance  alone. 

The  normalized  cumulative  density  function  for  the  highest  wave  in  each 
of  the  55  samples  did  not  have  the  properties  that  would  be  predicted  for 
a  random  sample  of  200  waves,  except  for  the  six  highest.  When  tested  against 


CONCLUSIONS  AND  RECOMMENDATIONS 


One  possible  conclusion  for  this  investigation  would  be  to  claim 
"insufficient  evidence"  and  that  no  conclusions  are  possible.  Quatrain 
XXVII  of  the  Rubaiyat  of  Omar  Khayyam  (Fitzgerald  (undated))  might  be  one 
way  for  the  reader  to  summarize  this  study  of  the  waves  that  were  measured 
during  Hurricane  Camille.  However,  the  complicated  interaction  of  numerous 
factors  in  real  data  can  lead  to  multiple  possible  causes  for  the  same 
observed  effect.  To  focus  on  one  or  two  of  these  factors  can  lead,  and  has 
lead,  to  misleading  conclusions.  The  conclusions  that  can  be  drawn  from 
this  study  must  be  highly  qualified.  The  conclusions  follow. 

1)  Samples  of  successive  crest  to  trough  ocean  wave  heights  can  be 
processed  to  give  various  statistics  in  a  number  of  different  ways. 

The  significant  wave  height  can  be  found  nine  different  ways.  Given 
the  spectrum  of  the  sample,  the  significant  wave  height  can  also  be 
found  from  the  area  under  the  spectrum.  Often  the  actual  sample 
significant  height  is  lower  than  one  computed  from  any  statistics 
based  on  the  total  sample  and  still  lower  than  the  one  computed  from 
the  spectrum. 

2)  The  tests  of  goodness-of-fit  that  compare  the  sample  to  the  Rayleigh 
pdf  usually  conclude  that  the  Rayleigh  pdf  does  not  describe  the  data. 

3)  The  five  possible  reasons  for  these  two  observed  facts  are: 

a)  The  tendency  for  the  Rayleigh  envelope  to  regress  toward 
the  mean  during  the  full  wave  cycle  if  the  spectrum  is  not 
narrow  and  thus  to  generate  a  pdf  with  a  deficit  of  both  low 
waves  and  high  waves  compared  to  the  Rayleigh. 

b)  The  breaking  of  the  higher  waves  on  the  ocean  thus  causing  . 

a  deficit  of  the  higher  waves,  with  this  deficit  being  compensated  for 
by  the  frequency  of  occurrence  of  intermediate  wave  heights. 

c)  The  difficulties  in  drawing  inferences  from  a  small  sample. 

The  probability  of  drawing  a  random  sample  that  will  include  the 
expected  number  of  sample  points  from  both  the  high  end  and  the 
low  end  of  the  theoretical  population  is  well  under  50%  so  that 
many  samples,  by  chance,  will  have  the  properties  predicted  by  (a) 


and  (b)  above. 


d)  The  small  samples  that  are  available  to  test  these  models 
are  not  random  samples.  They  are  consequently,  in  effect,  even 
smaller  than  the  sample  size  indicates.  The  use  of  the  size  cf 
the  non-random  sample  in  goodness-of-fit  tests  imposes  an  unduely 
severe  restriction  on  the  sampling  variability,  frequently  yielding 
the  rejection  of  the  hypothesis  being  tested. 

e)  The  area  under  the  variance  spectrum  of  the  ocean  wave  time 
history  may  have  been  over  estimated  because  of  digitization  errors. 

4)  It  is  not  possible  to  separate  the  five  reasons  for  what  was  observed 
with  the  present  data  set  from  Camille  although  this  data  is  very  high 
quality  and  did,  in  fact,  accomplish  the  difficult  goal  of  measuring 
very  high  waves. 

5)  The  decreases  in  the  spectral  variance  needed  to  bring  about  agreement 
with  the  wave  height  statistics  in  small.  A  more  careful  spectral  analysis 
and  an  estimate  of  the  contribution  from  various  error  sources  to  the 
total  variance  would  be  useful  supplemental  information  for  each  published 
spectrum. 

6)  Reasons  (a),(b),(c),  and  (d),  with  the  effects  for  (e)  removed  if  possible,  would 
be  indistinguishable  one  from  the  other  for  a  small  typical  sample.  The 
additional  free  parameters  from  these  theories  would  always  make  it  possible 
to  fit  many  of  the  samples  in  a  way  that  would  appear  to  be  an  improvement. 
If  the  theoretical  effects  are  actually  present,  fitting  a  sample  by  means 
of  the  additional  free  parameters  may  over-predict  the  effect  being  sought. 

7)  Nonlinear  wave  effects  that  have  not  been  considered  would  operate  in 
ways  that  would  increase  the  heights  of  the  higher  waves  in  a  sample.  It 
is  not  known  whether  or  not,  and  by  how  much, these  effects  would  cancel 
out  the  effects  predicted  in  (a)  and  (b)  above. 

8)  The  case  for  other  theoretical  probability  density  functions  that 
appear  to  be  an  improvement  on  the  Rayleigh  pdf  has  neither  been  proven 
nor  disproven  on  the  basis  of  either  the  present  literature  or  this 
analysis  of  the  Camille  data. 
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9)  More  carefully  estimated  spectra,  with  due  consideration  of  digitization 
errors  and  other  sources  of  unwanted  noise,  can  probably  provide  a  reliable  enough 
estimate  ofEQ(ormo)  to  permit  the  use  of  Rayleigh  statistics. 

10)  Apart  from  frequency  dependent  instrument  calibration  errors, 
remaining  digitization  errors  will  make  the  estimate  of  the  variance  too 
high,  and  for  design  purposes  the  estimates  will  be  conservative  because 
they  will  be  too  high. 

11)  If  the  only  available  data  consist  of  the  crest  to  trough  heights 
from  the  original  time  history  so  that  spectra  cannot  be  estimated,  it 
might  be  advisable  to  compute  Eq  both  from  the  mean  and  the  maximum  like¬ 
lihood  estimate  and  to  use  the  larger  of  the  two  to  compute  the  significant 
wave  height.  Bounds  on  the  significant  height  and  the  average  of  the  one 
tenth  highest  as  in  equations  (128)  and  (129)  with  a  correction  by  means 

of  a  guess  of  the  effective  sample  size  (tentatively  one  half  for  seas 
and  even  less  for  swell)  would,  at  least,  suggest  that  these  numbers  are 
not  well  known. 

12)  Exceptionally  high  waves  can  and  do  happen.  They  were  not  particularly 
unusual,  or  improbable,  in  the  context  of  the  eleven  thousand  waves  in  the 
total  sample  for  Hurricane  Camille.  However,  these  exceptional  waves  must 
be  very  nonlinear  and  present  theories  are  inadequate  to  describe  them  in 
the  context  of  a  non-stationary  random  process. 

13)  Both  the  Rayleigh  pdf  and  extreme  wave  theory  must  obviously  fail  if 
tested  against  an  extremly  long  duration  stationary  random  sample  of  waves. 
Those  heights  that  are  not  possible  from  physical -hydrodynamic  consider¬ 
ations  are  not  presently  known  and  were  not  revealed  by  this  study. 

From  the  above  conclusions,  some  recommendations  can  be  made.  These  recommen¬ 
dations  follow. 

1)  Spectra  estimated  from  ocean  wave  time  histories  need  to  be  checked 
for  digitization  errors  and  other  sources  of  error.  After  correction,  the 
area  under  the  spectrum,  and  its  square  root,  to  calculate  other  wave 
statistics  should  be  used. 

2)  Although  not  treated  in  this  report,  it  is  known  that  different 


organizations  use  different  spectral  modification  procedures.  Measurement 
and  data  analysis  systems  need  to  be  intercompared  and  standardized. 

3)  Extreme  waves  can  be  deliberately  modeled  in  such  a  way  that  they 
will  appear  at  a  given  place  at  a  given  time,  even  perhaps  as  part  of 
a  random  sea.  They  may  not  produce  the  worst  possible  condition  for  a 
particular  model  design,  but  the  conditions  produced,  especially  if  the 
carrier  frequency  and  the  Fourier  integral  spectrum  are  varied,  will 
provide  useful  test  data  on  extreme  conditions.  The  chance  that  the 
design  will  be  successful  under  real  conditions  will  be  increased. 

4)  Random  seas  can  be  generated  in  wind  wave  flumes  under  truly 
stationary  conditions  that  can  be  made  to  last  as  long  as  desired.  The 
sampling  variability  of  sub-samples  is  understood  in  terms  of  spectra. 

A  very  large  sample  as  in  the  work  of  Huang  and  Long  (1980) ,  except  in  larger 
faci lities,  could  eliminate  small  sample  effects  and  provide  results  that 
could,  perhaps,  be  scaled  up  to  actual  conditions. 

5)  Ocean  wave  forecasts  are  based  on  forecasting  the  spectra  and  ought 
to  be  verified  against  spectra.  A  convenient  linear  metric  for  comparing 
wave  forecasts,  and  wave  climatologies  based  on  past  wind  data,  is  four 
times  the  square  root  of  the  area  under  the  spectrum.  Such  a  metric 

is  the  first  gross  check  of  how  good  the  model  is.  Heights  that  are 
greatly  in  error  show  inadequacies  of  the  model.  Present  errors  in  wave 
forecasting  models  exceed  the  kinds  of  differences  found  in  the  various 
ways  to  compute  truncated  moments  and  in  the  corrections  that  would  be 
implied  if  any  of  the  newer  probability  models  were  used.  It  would  be 
necessary  to  predict  spectra  accurate  enough  to  specify  at  least  the 
first  and  second  moments  of  the  spectra  correctly  before  these  new  models 
could  be  used. 

6)  At  present  for  long  crested  random  waves  only,  nonlinear  models 
suggest  that  as  an  advancing  group  of  waves  coalesce  to  form  a  very  high 
wave,  the  "length"  of  the  waves  in  the  group  increases  so  as  to  preserve 
the  frequency  content  of  the  spectrum.  The  linear  theory  for  ship  motions 
in  waves  involves  the  frequency  of  encounter  and  integrals  of  the  wave 
form  along  the  hull  of  the  ship.  If  the  length  of  the  apparent  wave 
along  the  hull  of  the  ship  is  incorrect  because  of  nonlinear  effects,  the 
ship  response  will  not  be  correctly  computed. 
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